COMPOSITIONS AND METHODS FOR INCREASING AMINO ACID 
ABSORPTION IN MAMMALS 



Cross-Reference to Related Applications 

This application claims the benefit of U.S. Provisional Application Serial 
Number 60/273,263, filed March 2, 2001, under 35 U.S.C. 1 19(e) and U.S. 
Provisional Application Serial Number 60/344,088, filed December 26, 2001, xmder 
35 U.S.C. 119(e). 

Background of the Invention 

In dogs, it is thought that the abihty to absorb essential amino acids such as 
tryptophan and leucine may be limiting to cellular metabolism. Recent research 
designed to characterize the amino acid absorption capacity of the brush border 
(lumen facing) membranes of dog enterocytes suggests that peptide absorption may 
be particularly important given the relatively low amount of firee amino acid 
transport capacity that was observed. BuddingtonRK, Paulsen DB. Development 
of the Canine and Feline Gastrointestinal Tract. In: Reinhart GA, Carey DP, eds. 
Recent Advances in Canine and Feline Nutrition, Vol. II: 1998 lams Nutrition 
Symposium Proceedings, Wilmington: Orange Frazer Press, 1998; 195-215, Data 
collected from studies designed to understand the quantitative importance of free 
versus peptide amino acids in other monogastric animals strongly indicates that 
peptide-bound amino acids account for the majority of amino acids absorbed by 
enterocytes from the intestinal lumen (Matthews, DM. Protein Absorption, 
Development and Present State of the Subject, New York: Wiley-Liss, 1991.) and 
that the rate of peptide-derived amino acid absorption is faster than that by 
equivalent amounts of free amino acids. Ohkohchi N, Andoh T, Ohi R, Mori S. 
Defined formula diets alter characteristics of the intestinal transport of amino acid 
and peptide in growing rats. JPediatr Gastroenterol Nutr 1990 May; 10(4):490-6. 

Two types of peptide transporters have been cloned from monogastric 
animals. Liang R, Fei YJ, Prasad PD, Ramamoorthy S, Han H, Yang-Feng TL, 
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Hediger MA, Ganapathy V, Leibach FH. Human intestinal H+/peptide 
cotransporter. Cloning, functional expression, and chromosomal localization. J 
Biol Chem 1995 Mar 24; 270(12):6456-63. Liu W, Liang R, Ramamoorthy S, Fei 
YJ, Ganapathy ME, Hediger MA, Ganapathy V, Leibach FH. Molecular cloning of 
5 PEPT 2, a new member of the H+/peptide cotransporter family, from human kidney. 
Biochim Biophys Acta 1995 May 4; 1235(2):461-6. PepTl is an HMependent, 
low-affinity (mM), high- velocity, transporter that is predominately localized 
primarily to the brush border membranes of mature enterocytes of intestinal villi. 
PepT2 is an H^-dependent, high-affinity (|aM), low-velocity, transporter that is 
1 0 expressed in the greatest abundance in the apical membranes of renal proximal 
tubular epithelial cells. An important feature of the peptide transporters is their 
ability to recognize and transport most di- and tripeptides, albeit with a range of 
relative affinities for different peptides. In addition, both transporters recognize the 
p-lactam antibiotics, and carboxyl-terminal modified free amino acids. The 
1 5 physiologic functions of these transporters are thought to be to absorb di- and 

tripeptides from the digesta and from the blood, respectively. Although molecular 
evidence has not been acquired, there is strong biochemical evidence for a different 
peptide transport protein that functions in the basolateral membrane of these cells. 
Saito H, Inui KI. Dipeptide trmsporters in apical and basolateral membranes of the 
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t 20 human intestinal cell line Caco-2. Am J Physiol 1 993 Aug; 265(2 Pt l):G289-94. 



fy Thwaites DT, Brown CD, Hirst BH, Simmons NL. Transepithelial glycylsarcosine 

transport in intestinal Caco-2 cells mediated by the expression of ff-coupled 
carriers at both the apical and basal membranes. J Biol Chem 1993 Apr 15; 
268(1 1):7640-2. 

25 Research with Caco-2 cells indicates that PepTl transporter mRNA, protein, 

and activity increases in a manner consistent with a direct effect of increased 
extracellular substrate concentrations. Walker D, Thwaites DT, Simmons NL, 
Gilbert HJ, Hirst BH. Substrate upregulation of the human small intestinal peptide 
transporter, hPepTl. JP/zy^zo/ 1998 Mar 15; 507(Pt 3):697-706. In contrast to 

30 mRNAs for essential amino acid transporters, intestinal studies show that the 
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expression of peptide transporter mRNA increases in response to increased dietary 
protein. Erickson RH, Gum JR Jr, Lindstrom MM, McKean D, Kim YS. Regional 
expression and dietary regulation of rat small intestinal peptide and amino acid 
transporter mRNAs. Biochem Biophys Res Commun 1995 Nov 2; 216(1):249'57. 
Similarly, expression in intestinal mucosa of PepTl mRNA and protein increases in 
response to tissue trauma, whereas the mRNA for essential amino acid transporters 
decreases. Tanaka H, Miyamoto KI, Morita K, Haga H, Segawa H, Shiraga T, 
Fujioka A, Kuoda T, Taketani Y, Hisano S, Fukui Y, Kitagawa K, Takeda E. 
Regulation of the PepTl peptide transporter in the rat small intestine in response to 
5-fluorouracil-induced injury. Gastroenterology 1998 Apr; 114(4):714-23. 

Few studies have been conducted to evaluate the potential for the dog to 
absorb quantitatively significant amounts of essential amino acids in the foim of 
small peptides, and whether this capacity can be regulated by substrate supply. 
Accordingly, there is still a need to evaluate the potential for the absorption of 
peptide-bound leucine and tryptophan by putative canine peptide transporters. It 
would thus be desirable to provide the nucleic acid sequence encoding canine 
PepTl . It would also be desirable to provide mRNA transcripts corresponding to 
cPepTl. It would further be desirable to characterize the function of cPepTl by 
GlySar uptake and identify di- and tripeptides well recognized by cPepTl, as well as 
characterize the effect of supplemental peptide substrate on the transport capacity of 
canine PepTl (cPepTl), 

Summary of the Invention 

The present invention provides novel isolated and purified nucleic acids 
(RNA or DNA) encoding, or complementary to, canine PepTl (cPepTl). The 
nucleic acid may be SEQ ID N0:7, SEQ ID N0:8, SEQ ID N0:9, or SEQ ID 
NO:20 or may be a nucleic acid that hybridizes under moderate or stringent 
hybridization conditions to any of these sequences. Also provided are peptides 
encoded by these nucleic acids, such as SEQ ID N0:13 or SEQ ID N0:2L 



The present invention also provides a method for determining canine PepTl- 
transportability of a peptide, or method for detemiining a peptide with beneficial 
nutritional property in an animal, comprising providing an immortalized kidney 
distal tubule epitheUal (Madin-Darby Canine Kidney (MDCK)) cell and a peptide 
having 2 to 10 amino acids, and determining the amount of the peptide transported 
into the cell, wherein the amount correlates with the canine PepTl -transportability 
of the peptide. A peptide with beneficial nutritional properties in an animal is a 
peptide that contains at least one essential amino acid that is absorbed at a rate 
higher than the rate of the amino acid if it were free rather than in a peptide-bound 
form. The peptide may be a dipeptide, tripeptide, or tetrapeptide such as, for 
example, GlySar, GlyGly, AlaHis, 6-AlaHis (camosine), GlnGln, GlyMet, LeuMet, 
LeuTrp, MetLeu, MetMet, MetPhe, MetPro, TrpLeu, TrpTrp, GlnGlu, MetGlu, 
MetLys, TrpGiy, MetGlyMetMet (SEQ ID NO: 10), TrpGlyGly, LeuArg, ArgLeu, 
GlyLeu, or ArgTrp. The cell used in the method may be in medixmi at a pH of 
between about 5 and 8; or at a pH of about 5.5 to 7.5, or even at about 6 to 6.5. The 
peptide may be present at a concentration of about 10 nm to about 50 mM. 

The characterization of GlySar uptake by immortalized MDCK cells 
demonstrates that MDCK cells express PepTl-like activity, confirming detection of 
PepTl niRNA expression by MDCK cells and the use of MDCK cells as a model to 
characterize the biochemical function of canine PepTl . 

The cPepTl of the present invention is also capable of recognizing a variety 
of di- and tripeptides, including those that contain the essential amino acids leucine 
and tryptophan, considered to be of especial importance to canine nutrition. In 
addition, BT-dependent peptide transport in cultured MDCK cells can be stimulated 
by at least two of PepTl substrates, GlySar and camosine. Moreover, ff-dependent 
uptake of GlySar by MDCK is sensitive to nutrient deprivation and Insulin-like 
Growth factor I (IGF-I). 

The present invention further provides a dietary composition with improved 
nutritional benefit for an animal comprising at least one peptide identified by the 
method described above. 



The present invention provides a process for altering the absorption of 
essential amino acids in an animal, such as a dog, comprising the steps of feeding 
the animal a diet containing the dietary composition described above; and 
maintaining the animal on the diet for a sufficient period of time to allow the 
5 composition to be absorbed by the digestive system of the animal. The diet may 
comprise about 20 to about 30% crude protein, about 10 to about 20% fat, and about 
3 to about 10%) dietary fiber. 

As used herein, the term "cPepTl" includes variants or biologically active or 
inactive fragments of this transport protein. A 'Variant" of the polypeptide is a 
10 cPepTl protein that is not completely identical to a native cPepTl protein. A 
variant cPepTl protein can be obtained by altering the amino acid sequence by 
insertion, deletion or substitution of one or more amino acid. The amino acid 
sequence of the protein is modified, for example by substitution, to create a 
O polypeptide having substantially the same or improved qualities as compared to the 

^ 15 native polypeptide. The substitution may be a conserved substitution. A 

"conserved substitution" is a substitution of an amino acid with another amino acid 
having a similar side chain. A conserved substitution would be a substitution with 
an amino acid that makes the smallest change possible in the charge of the amino 
acid or size of the side chain of the amino acid (alternatively, in the size, charge or 
20 kind of chemical group within the side chain) such that the overall peptide retains its 
spacial conformation but has altered biological activity. For example, common 
conserved changes might be Asp to Glu, Asn or Ghi; His to Lys, Arg or Phe; Asn to 
Gin, Asp or Glu and Ser to Cys, Thr or Gly. Alanine is commonly used to substitute 
for other amino acids. The 20 common amino acids can be grouped as follows: 
25 alanine, valine, leucine, isoleucine, proline, phenylalanine, tryptophan and 

methionine having nonpolar side chains; glycine, serine, threonine, cystine, tyrosine, 
asparagine and glutamine having uncharged polar side chains; aspartate and 
glutamate having acidic side chains; and lysine, arginine, and histidine having basic 
side chains. Stryer, L. Biochemistry (2d edition) W. H. Freeman and Co. San 
30 Francisco (1981), p. 14-15; Lehninger, A. Biochemistry (2d ed., 1975), p. 73-75. It 
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is known to those of skill in the art that transport of other, less common, amino acids 
such as hydroxylysine, or derivatives of any one of the 20 common amino acids 
listed above would also be within the scope of this invention. 

It is known that variant polypeptides can be obtained based on substituting 
5 certain amino acids for other amino acids in the polypeptide structure in order to 
modify or improve biological activity. For example, through substitution of 
altemative amino acids, small conformational changes may be conferred upon a 
polypeptide that result in increased bioactivity. Altematively, amino acid 
substitutions in certain polypeptides may be used to provide residues that may then 
10 be linked to other molecules to provide peptide-molecule conjugates that retain 
sufficient properties of the starting polypeptide to be useful for other purposes. 

One can use the hydropathic index of amino acids in conferring interactive 
biological function on a polypeptide, wherein it is found that certain amino acids 
may be substituted for other amino acids having similar hydropathic indices and still 
15 retain a similar biological activity. Altematively, substitution of like amino acids 
may be made on the basis of hydrophiUcity. It is noted that substitutions can be 
made based on the hydrophihcity assigned to each amino acid. In using either the 
hydrophilicity index or hydropathic index, which assigns values to each amino acid, 

S it is preferred to conduct substitutions of amino acids where these values are ±2, 
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20 with ±1 being particularly preferred, and those with in ±0.5 being the most preferred 
substitutions. 

The variant cPepTl protein comprises at least seven amino acid residues, 
preferably about 20 to about 700 residues, and more preferably about 50 to about 
700 residues, wherein the variant cPepTl protein has at least 50%, preferably at 
25 least about 80%, and more preferably at least about 90% but less than 100%, 

contiguous amino acid sequence homology or identity to the amino acid sequence of 
a corresponding native cPepTl protein. 

The amino acid sequence of the variant cPepTl protein corresponds 
essentially to the native cPepTl protein amino acid sequence. As used herein 
30 "correspond essentially to" refers to a polypeptide sequence that will eUcit an 



absorption value substantially the same as the absorption stimulated by native 
cPepTl protein. Such absorption may be at least 60% of the level generated by 
native cPepTl protein, and may even be at least 80% of the level generated by 
native cPepTl protein. 
5 A variant of the invention may include amino acid residues not present in the 

corresponding native cPepTl protein, or may include deletions relative to the 
corresponding native cPepTl protein. A variant may also be a truncated "fragment" 
as compared to the corresponding native cPepTl protein, Le,, only a portion of a 
full-length protein. cPepTl protein variants also include peptides having at least 
1 0 one D-amino acid. 

The cPepTl protein of the present invention may be expressed from an 
isolated nucleic acid (DNA or RNA) sequence encoding the cPepTl protein. Amino 
% acid changes fi"om the native to the variant cPepTl protein may be achieved by 

2 changing the codons of the corresponding nucleic acid sequence. "Recombinant" is 

H 1 5 defined as a peptide or nucleic acid produced by the processes of genetic 

S engineering. It should be noted that it is well-known in the art that, due to the 

redundancy in the genetic code, individual nucleotides can be readily exchanged in a 
p codon, and still result in an identical amino acid sequence. The terms "protein," 

JS; "peptide" and "polypeptide" are used interchangeably herein, 
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fij Brief Descriptioii of the Figures 

Figure 1 is a photograph of an electrophoresis gel showing the partial length 
canine PepTl cDNA reaction products generated by reverse transcription- 
polymerase chain reaction (RT-PCR) methodology. Partial length canine PepTl 

25 (cPepTl, about 783 bp) cDNAs were generated by reverse transcription-polymerase 
chain reaction (RT-PCR) methodology. RT-PCR reaction products were generated 
using mRNA isolated from canine jejunal epitheUum and two different PCR primer 
sets. Gel contents are as follows: lane 1, 1 Kb molecular weight DNA ladder; lane 
2, negative control PCR reaction (lacks Taq polymerase); lane 3, PCR reaction 

30 products using primer set 4 (corresponding to base pairs 83 to 863 of rabbit PepTl); 



lane 4, PGR reaction products using primer set 10-^780 bp cDNA product using 
primer set 10 (corresponding to base pairs 85 to 861 of rabbit PepTl). Note the 
reaction products in lanes 3 and 4 of about 780 base pairs. 

Figures 2A and 2B are photographs of agarose gels showing the 
5 representative results of restriction analyses of pCR® II/cPepT 1 plasmids generated 
by TA-cloning of primer set 4-derived RT-PCR cDNA. Restriction analyses of 
pCR®II/cPepTl plasmids generated by TA-cloning of primer set 4-derived RT-PCR 
cDNA are shown in these figures. Data are representative of four cDNA-containing 
plasmids from a total of fifty-six "positive" bacterial colonies selected by blue/white 
10 screening. TA-clones were amphfied, pCR®II/cDNA vectors isolated, and Xho I 
and Kpn I endonucleases restriction products size-separated through 1 .2% agarose 
gels. Figure 2A is a photograph of an agarose gel showing representative results of 
Jf the PCR-based analyses of TA-clone 26. In particular, analyses of pCR®II/cPepTl- 

O 26 (TA-clone 26) are shown; lane 1, 1 DNA size standard; lane 2, minus 

^ 1 5 endonuclease-restriction control (uncut pCR®n plasmid); lane 3, positive restriction 

control (Xho I restriction of empty pCR®n vector); lane 4, uncut pCR®II/cPepTl-26 
fll (Clone 26); lane 5, Xho I- and Kpn I-restricted Clone 26. Note that empty pCR®II 

vector is 3.9 kb in size and that lane 5 contains a product of about 780 bp. 
Figure 2B is a photograph of an agarose gel showing representative results of the 
20 PCR-based analyses of TA-clone 4 and 6. In particular, analyses of TA-clone 4 and 
6 are shown; lane 1, DNA size stand^d; lane 2, uncut pCR®II/cPepTl-4 (TA-clone 
4); lane 3, Xho I- and Kpn I-restricted pCR®II/cPepTl-4; lane 4, uncut 
pCR®II/cPepTl-6 (TA-clone 6); lane 5, Xho I- and Kpn I-restricted pCR®II/cPepTl- 
6. Note that empty pCR®II vector is 3.9 kb in size and that lane 3 does not contain a 
25 product of about 780 bp, whereas lane 5 does. 

Figures 3 A and 3B are photographs showing the representative results of 
Northern blot identification of cPepTl mRNA expression by canine tissues and 
MDCK cells using canine intestinal epitheliimi-derived RT-PCR cDNA. 
Arrangement of RNA isolated from tissue or cell homogenates on both blots is as 
30 follows: lane 1, kidney (animal #103 1 A); lane 2, kidney (animal K-9-1); lane 3, 
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MDCK cells; lane 4, jejunal epithelium (animal K-9-4). Figure 3A is a photograph 
showing the Northern blot identification of ATlNA (3 ng/lane) that was hybridized 
with [^^P]-cPepTl-26 cDNA. Figure 3B is a photograph showing the Northern blot 
identification of total RNA (20 |xg/lane) that was hybridized with [^^P]-cPepTl-6 
5 cDNA. 

Figure 4 is a photograph showing the representative results of Northern blot 
identification of cPepTl mRNA expression in catune tissues using full-length rabbit 
PepTl cDNA. Ten |ig total RNA (lane 1) or 6 |ag AllNA (lanes 2 to 5) were 
isolated from liver and kidney tissues from three animals. Latie 1, liver (animal 
10 #1042A); lane 2, liver (animal #1008A); lane 3, kidney (animal #1008A); lane 4, 
liver (animal #1031A); lane 5, kidney (animal #1031 A). 

Figure 5 is the partial-length nucleic acid sequence of canine PepTl cDNA 
Q of tiie present invention that was cloned from MDCK cells (SEQ ID N0:9). The 

^ 381 base pairs of TA clone PepTl-6R-20 shares 79% homology to base pairs 259 to 

15 640 of rabbit PepTl (GenBank acc no. 473375). 

Figure 6 is a graph illustrating the influence of extracellular GlySar 
[ ^ concentrations on GlySar uptake by confluent MDCK cells in pH 6.0 media. By 

Q graphical evaluation, an appa-ent K„ of about 4 mM was demonstrated. Each data 

point is the mean of 5 to 6 observations and all coefficients of variation were less 
20 than 15%. 

Figure 7 is a graph illustrating the protein content of MDCK cells cultured 
in DMEM or LHM. Values are the means ± SD of protein content of wells (n = 12) 
of MDCK cells after seeding at 60,000 or 120,000 cells/well, culture for 1 d in 
DMEM, and then culture in DMEM or LHM for 1, 2, 3, or 5 d (Days 2, 3, 4 and 6, 
25 respectively). Protein content was determined by the method of Lowry, using 
bovine serum albumin as the standard. 

Figure 8 is a graph illustrating GlySar (2.88 \xM) uptake in pH 6.0 or pH 7.4 
buffer by MDCK cells cultured in DMEM or LHM. Uptake was measured in the 
absence (pH 7.4) or presence (pH 6.0) of an extracellular-to-intracellular 
30 gradient. 
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Figure 9 is a graph illustrating H^-dependent [^H]-GlySar (2.88 |aM) uptake 
by MDCK cells cultured in DMEM or LHM. Values were calculated as the 
difference in GlySar uptake in the presence (pH 6.0 uptake buffer) and absence (pH 
7.4 uptake buffer) of an extracellular-to-intercellular proton gradient. 

Figure 10 is a graph illustrating pH-dependent GlySar uptake by MDCK 
cells seeded at 60,000 cells/well and cultured in LHM for 2 days. pH-dependent 
GlySar (2.88 uM) uptake by MDCK cells cultured with standard conditions. Values 
represent the H"'-dependent GlySar uptake means ± SD of wells (n = 16) of MDCK 
cells, calculated as the difference from GlySar uptake in the presence of pH 6.0 or 
7.4 buffers. 

Figure 11 is a graph illustrating the effect of time on GlySar uptake 
(100|LiM) by MDCK cells. By-minute time course for GlySar (uptake by MDCK 
cells cultured with standard conditions. Mean ± SD GlySar uptake wells of cells (n 
= 6) were assayed at 3.75, 7.5, 15, 30, 60, or 120 min. 

Figure 12 is a graph illustratmg the effect of GlySar concentration on 
MDCK cells seeded at 60K/well grown in LHM. The graph indicates the K^ 
characterization (1 .0 mM) of H'^-dependent GlySar uptake by MDCK cells. Each 
value represents the mean ± SD uptake of GlySar by wells (n = 8) of MDCK cells 
cultured using standard conditions. 

Figure 13 is a graph illustrating the inhibition of peptide uptake by MDCK 
cells with antibiotics. The mean ± SD are the uptake of GlySar by wells (n = 5-8) of 
MDCK cells in the absence or presence of GlySar (1 mM) Penicillin-G (3 mM), 
cefadroxil (30 |liM), or cefadroxil (3 mM). 

Figure 14 is a graph illustrating the inhibition of peptide uptake by MDCK 
cells with Gly-containing peptides. The mean ± SD uptake of GlySar by wells (n = 
7-8) of MDCK cells in the absence or presence of indicated competitor substrates (1 
mM). 

Figure 15 is a graph illustratmg the inhibition of 100 |iM GlySar uptake by 
ImM TrpLeu, LeuTrp, Leu, or Trp in the absence (pH 7.5) and presence (no pH 
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designation) of a proton gradient and 1 mM of indicated substrates. Values are the 
mean ± SD uptake of GlySar by wells (n = 7-8) of MDCK cells. 

Figure 16 is a graph illustrating the inhibition of 100 jiM GlySar uptake by 
MDCK cells in the absence (pH 7.5) and presence (no pH designation) of a proton 
gradient and 1 mM of Trp-containing peptides. Values are the mean ± SD uptake of 
GlySar by wells (n = 7-8) of MDCK cells. 

Figure 17 is a graph illustrating the inhibition of 100 |aM GlySar uptake by 
MDCK cells in the absence (pH 7.5) and presence (no pH designation) of a proton 
gradient and 100 jaM of Trp-containing peptides. Values are the mean ± SD uptake 
of GlySar by wells (n - 8) of MDCK cells. 

Figure 18 is a graph illustrating the IC50 inhibition of H^-dependent GlySar 
uptake by TrpLeu and TrpTrp. K^ values were determined for inhibition of ff- 
dependent 100 \iM GlySar uptake by MDCK cells in the presence of 0, 0.025, 0.1, 
0.4, or 1 .6 mM TrpTrp or TrpLeu. Values are the mean ± SD uptake of GlySar by 
wells (n = 6-8) of MDCK cells. 

Figure 19 is a graph illustrating substrate (lOmM) regulation of protein 
content of MDCK cells cultured in DMEM. In particular, the influence of 10 mM 
camosine, glycylphenylalanine (GlyPhe), Phe, or Gly supplementation of DMEM 
on protein content of MDCK cells was measured. 

Figure 20 is a graph illustrating substrate (lOmM) regulation of GlySar 
uptake by MDCK cells cultured in DMEM. In particular, the influence of 10 mM 
camosine, glycylphenylalanine (GlyPhe), Phe, or Gly supplementation of DMEM 
on H^-dependent uptake of [^H]Glycylsarcosine (GlySar) by MDCK cells was 
measured. 

Figure 21 is a graph illustrating substrate (lOmM) regulation of protein 
content of MDCK cells cultured in DMEM. In particular, the influence of 10 mM 
glycylsarcosine (GlySar), glycylproline (GlyPro), glycylphenylalanine (GlyPhe), or 
camosine of DMEM on protein content of MDCK cells was measured. 

Figure 22 is a graph illustrating substrate (lOmM) regulation of GlySar 
uptake by MDCK cells cultured in DMEM. In particular, the influence of 10 mM 
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glycylsarcosine (GlySar), glycylproline (GlyPro), glycylphenylalanine (GlyPhe), or 
camosine on H'^-dependent uptake of pH]Glycylsarcosine (GlySar) by MDCK cells 
was measured. 

Figure 23 is a graph illustrating the influence of DMEM, nutrient depleted, 
dexamethasone (Dex), or insulin (ins) on H'"-dependent uptake of 
['HJGlycylsarcosine (GlySar) by MDCK cells. 

Figure 24 is a graph illustrating influence of IGF-I on H''-dependent uptake 
of ['H]Glycylsarcosine (GlySar) by MDCK cells. 

Definitions 

The term "gene" is used broadly to refer to any segment of nucleic acid 
associated with a biological function. Thus, genes include coding sequences and/or 
the regulatory sequences required for their expression. For example, gene refers to a 
nucleic acid fragment that expresses mRNA, functional RNA, or specific protein, 
including regulatory sequences. Genes also include nonexpressed DNA segments 
that, for example, form recognition sequences for other proteins. Genes can be 
obtained from a variety of sources, including cloning from a source of interest or 
synthesizing from known or predicted sequence information, and may include 
sequences designed to have desired parameters. 

The term "native gene" refers to gene that is present in the genome of an 
imtransformed cell. 

"Naturally occurring" is used to describe an object that can be found in 
nature as distinct from bemg artificially produced by man. For example, a protein or 
nucleotide sequence present in an organism (including a virus), which can be 
isolated from a source in nature and which has not been intentionally modified by 
man in the laboratory, is naturally occurring. 

A "marker gene" encodes a selectable or screenable trait. 

The term "chimeric gene" refers to any gene that contains 1) DNA 
sequences, including regulatory and coding sequences, that are not found together in 
nature, or 2) sequences encoding parts of proteins not naturally adjoined, or 3) parts 
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of promoters that are not naturally adjoined. Accordingly, a chimeric gene may 
comprise regulatory sequences and coding sequences that are derived from different 
sources, or comprise regulatory sequences and coding sequences derived from the 
same source, but arranged in a manner different from that found in nature. 

A "transgene" refers to a gene that has been introduced into the genome by 
transformation and is stably maintained. Transgenes may include, for example, 
genes that are either heterologous or homologous to the genes of a particular cell to 
be transformed. Additionally, transgenes may comprise native genes inserted into a 
non-native orgmiism, or chimeric genes. The term "endogenous gene" refers to a 
native gene in its natural location in the genome of an organism. A "foreign" gene 
refers to a gene not normally found in the host organism but that is introduced by 
gene transfer. 

The terms "protein," "peptide" and "polypeptide" are used interchangeably 

herein. 

Expression cassettes will comprise the transcriptional initiation region of the 
invention linked to a nucleotide sequence of interest. Such an expression cassette is 
provided with a plurality of restriction sites for msertion of the gene of interest to be 
under the transcriptional regulation of the regulatory regions. The expression 
cassette may additionally contain selectable marker genes. 

The transcriptional cassette will include in the 5'-3' direction of transcription, 
a transcriptional and translational initiation region, a DNA sequence of interest, and 
a transcriptional and translational termination region. The termination region may 
be native with the transcriptional initiation region, may be native with the DNA 
sequence of interest, or may be derived from another source. 

An oUgonucleotide for use in probing or amphfication reactions may be 
about 30 or fewer nucleotides in length (e.g., 9, 12, 15, 18, 20, 21 or 24, or any 
number between 9 and 30). Generally specific primers are upwards of 14 
nucleotides in length. For optimum specificity and cost effectiveness, primers of 
16-24 nucleotides in length may be preferred. Those skilled in the art are well 
versed in the design of primers for use processes such as PGR. If required, probing 
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can be done with entire restriction fragments of the gene disclosed herein which may 
be lOO's or even lOOO's of nucleotides in length. 

"Coding sequence" refers to a DNA or RNA sequence that codes for a 
specific amino acid sequence and excludes the non-coding sequences. It may 
constitute an "uninterrupted coding sequence", i.e., lacking an intron, such as in a 
cDNA or it may include one or more introns bounded by appropriate splice 
junctions. An "intron" is a sequence of RNA which is contained in the primary 
transcript but which is removed through cleavage and re-Ugation of the RNA within 
the cell to create the mature mRNA that can be translated into a protein. 

The terms "open reading frame" and "ORF" refer to the amino acid sequence 
encoded between translation initiation and termination codons of a coding sequence. 
The terms "initiation codon" and "termination codon" refer to a unit of three 
adjacent nucleotides ('codon') in a coding sequence that specifies initiation and chain 
termination, respectively, of protein synthesis (mRNA translation), 

A "fimctional RNA" refers to an antisense RNA, ribozyme, or other RNA 
that is not translated. 

The term "RNA transcript" refers to the product resulting from RNA 
polymerase catalyzed transcription of a DNA sequence. When the RNA transcript is 
a perfect complementary copy of the DNA sequence, it is referred to as the primary 
transcript or it may be a RNA sequence derived from post-transcriptional processing 
of the primary transcript and is referred to as the mature RNA. "Messenger RNA" 
(mRNA) refers to the RNA that is without introns and that can be translated into 
protein by the cell. "cDNA" refers to a single- or a double-stranded DNA that is 
complementary to and derived from mRNA. 

"Regulatory sequences" and "suitable regulatory sequences" each refer to 
nucleotide sequences located upstream (5' non-coding sequences), within, or 
downstream (3' non-coding sequences) of a coding sequence, and which influence 
the transcription, RNA processing or stability, or translation of the associated coding 
sequence. Regulatory sequences include enhancers, promoters, translation leader 
sequences, introns, and polyadenylation signal sequences. They include natural and 
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synthetic sequences as well as sequences which may be a combination of synthetic 
and natural sequences. As is noted above, the term "suitable regulatory sequences" 
is not limited to promoters. 

"5' non-coding sequence" refers to a nucleotide sequence located 5' 
5 (upstream) to the coding sequence. It is present in the fiiUy processed mRNA 

upstream of the initiation codon and may affect processing of the primary transcript 
to mRNA, mRNA stability or translation efficiency. (Turner et al., Molecular 
Biotechnology. 3:225 (1995)). 

"3' non-coding sequence" refers to nucleotide sequences located 3* 
10 (downstream) to a coding sequence and include polyadenylation signal sequences 
and other sequences encoding regulatory signals capable of affecting mRNA 
processing or gene expression. The polyadenylation signal is usually characterized 
by affecting the addition of polyadenyUc acid tracts to the 3' end of the mRNA 
precursor. 

1 5 The term "translation leader sequence" refers to that DNA sequence portion 

of a gene between the promoter and coding sequence that is transcribed into RNA 
and is present in the fully processed mRNA upstream (50 of the translation start 
codon. The translation leader sequence may affect processing of the primary 
transcript to mRNA, mRNA stability or translation efficiency. 

20 The term "mature" protein refers to a post-translationally processed 

polypeptide without its signal peptide. "Precursor" protein refers to the primary 
product oftranslation of an mRNA. "Signal peptide" refers to the mnino terminal 
extension of a polypeptide, which is translated in conjunction with the polypeptide 
forming a precursor peptide and which is required for its entrance into the secretory 

25 pathway. The term "signal sequence" refers to a nucleotide sequence that encodes 
the signal peptide. 

The term "intracellular localization sequence" refers to a nucleotide sequence 
that encodes an intracellular targeting signal. An "intracellular targeting signal" is 
an amino acid sequence that is translated in conjunction with a protein and directs it 
30 to a particular sub-cellular compartment. "Endoplasmic reticulum (ER) stop transit 
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signal" refers to a carboxy-terminal extension of a polypeptide, which is translated 
in conjunction with the polypeptide and causes a protein that enters the secretory 
pathway to be retained in the ER. "ER stop transit sequence" refers to a nucleotide 
sequence that encodes the ER targeting signal. 
5 "Promoter" refers to a nucleotide sequence, usually upstream (5') to its 

coding sequence, that controls the expression of the coding sequence by providing 
the recognition for RNA polymerase and other factors required for proper 
transcription. "Promoter" includes a minimal promoter that is a short DNA sequence 
comprised of a TATA- box and other sequences that serve to specify the site of 
10 transcription initiation, to which regulatory elements are added for control of 

expression. "Promoter" also refers to a nucleotide sequence that includes a minimal 
promoter plus regulatory elements that is capable of controlling the expression of a 
coding sequence or functional RNA. This type of promoter sequence consists of 
9 proximal md more distal upstream elements, the latter elements often referred to as 

□ 15 enhancers. Accordingly, an "enhancer" is a DNA sequence which can stimulate 
% promoter activity and may be an innate element of the promoter or a heterologous 
i U element inserted to enhance the level or tissue specificity of a promoter. It is 

□ capable of operating in both orientations (normal or flipped), and is capable of 

:i{ functioning even when moved either upstream or downstream from the promoter. 

20 Both enhancers and other upstream promoter elements bind sequence-specific DNA- 
7| binding proteins that mediate their effects. Promoters may be derived in their 

entirety jfrom a native gene, or be composed of different elements derived from 
different promoters found in nature, or even be comprised of synthetic DNA 
segments. A promoter may also contain DNA sequences that are involved in the 
25 binding of protein factors which control the effectiveness of transcription initiation 
in response to physiological or developmental conditions. 

The "initiation site" is the position surrounding the first nucleotide that is 
part of the transcribed sequence, which is also defined as position +1. With respect 
to this site all other sequences of the gene and its controlling regions are numbered. 
30 Downstream sequences (i.e. further protein encoding sequences in the 3' direction) 
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are denominated positive, while upstream sequences (mostly of the controlling 
regions in the 5' direction) are denominated negative. 

Promoter elements, particul^ly a TATA element, that are inactive or that 
have greatly reduced promoter activity in the absence of upstream activation are 
referred to as "minimal or core promoters." In the presence of a suitable 
transcription factor, the minimal promoter functions to permit transcription. A 
"minimal or core promoter" thus consists only of all basal elements needed for 
transcription initiation, e.g., a TATA box and/or an initiator. 

"Inducible promoter" refers to those regulated promoters that can be turned 
on in one or more cell types by an external stimulus, such as a chemical, Ught, 
hormone, stress, or a pathogen. 

"Operably-hnked" refers to the association of nucleic acid sequences on 
single nucleic acid fragment so that the function of one is affected by the other. For 
example, a regulatory DNA sequence is said to be "operably linked to" or 
"associated with" a DNA sequence that codes for an RNA or a polypeptide if the 
two sequences are situated such that the regulatory DNA sequence affects 
expression of the coding DNA sequence (i.e., that the coding sequence or functional 
RNA is under the transcriptional control of the promoter). Coding sequences can be 
operably-linked to regulatory sequences in sense or antisense orientation. 

"Expression" refers to the transcription and/or translation of an endogenous 
gene or a transgene in cells. For example, in the case of antisense constructs, 
expression may refer to the transcription of the antisense DNA only. In addition, 
expression refers to the transcription md stable accumulation of sense (mRNA) or 
functional RNA. Expression may also refer to the production of protein. 

The analysis of transcription start points in practically all promoters has 
revealed that there is usually no single base at which transcription starts, but rather a 
more or less clustered set of initiation sites, each of which accounts for some start 
points of the mRNA. Since this distribution varies from promoter to promoter the 
sequences of the reporter mRNA in each of the populations would differ from each 
other. Since each mRNA species is more or less prone to degradation, no single 
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degradation rate can be expected for different reporter mRNAs. It has been shown 
for various eukaryotic promoter sequences that the sequence surrounding the 
initiation site ('initiator') plays an important role in determining the level of RNA 
expression directed by that specific promoter. This includes also part of the 
transcribed sequences. The direct fusion of promoter to reporter sequences would 
therefore lead to much suboptimal levels of transcription. 

A commonly used procedure to analyze expression patterns and levels is 
through determination of the 'steady state' level of protein accumulation in a cell. 
Commonly used candidates for the reporter gene, known to those skilled in the art 
are 9-glucuronidase (GUS), growth hormone (GH), Chloramphenicol Acetyl 
Transferase (CAT) and proteins with fluorescent properties, such as Green 
Fluorescent Protein (GFP) from Aequora victoria. In principle, however, many 
more proteins are suitable for this purpose, provided the protein does not interfere 
with essential cell functions. For quantification and determination of locahzation a 
number of tools are suited. Detection systems can readily be created or are available 
which are based on e.g. immunochemical, enzymatic, fluorescent detection and 
quantification. Protein levels can be determined in cell extracts or in intact tissue 
using in situ analysis of protein expression. 

Generally, individual transformed lines with one chimeric promoter reporter 
construct will vary in their levels of expression of the reporter gene. Also frequently 
observed is the phenomenon that such transformants do not express any detectable 
product (RNA or protein). The variability in expression is commonly ascribed to 
'position effects, although the molecular mechanisms underlying this inactivity are 
usually not clear. 

"Non-specific expression" refers to constitutive expression or low level, 
basal ('leaky') expression in nondesired cells or tissues from a 'regulated promoter'. 

"Antisense inhibition" refers to the production of antisense RNA transcripts 
capable of suppressing the expression of protein from an endogenous gene or a 
transgene. 
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"Co-suppression" and "transwitch" each refer to the production of sense 
RNA transcripts capable of suppressing the expression of identical or substantially 
similar transgene or endogenous genes (U.S. Patent No. 5,231,020). 

"Homologous to" refers to the similarity between the nucleotide sequence of 
two nucleic acid molecules or between the amino acid sequences of two protein 
molecules. Estimates of such homology are provided by either DNA-DNA or 
DNA-RNA hybridization under conditions of stringency as is well understood by 
those skilled in the art (as described in Haines and Higgins (eds,), Nucleic Acid 
Hybridization, IRL Press, Oxford, U.K.), or by the comparison of sequence 
similarity between two nucleic acids or proteins. 

The term "substantially similar" refers to nucleotide and amino acid 
sequences that represent equivalents of the instant inventive sequences. For 
example, altered nucleotide sequences which simply reflect the degeneracy of the 
genetic code but nonetheless encode amino acid sequences that are identical to the 
inventive amino acid sequences are substantially similar to the inventive sequences. 
In addition, amino acid sequences that are substantially similar to the instant 
sequences are those wherein overall amino acid identity is 95% or greater to the 
instant sequences. Modifications to the instant invention that result in equivalent 
nucleotide or amino acid sequences is well within the routine skill in the art. 
Moreover, the skilled artisan recognizes that equivalent nucleotide sequences 
encompassed by this invention can also be defmed by their ability to hybridize, 
under stringent conditions (O.IX SSC, 0.1% SDS, 65°C), with the nucleotide 
sequences that are within the literal scope of the instant claims. 

"Transgene activation system" refers to the expression system comprised of 
an inactive transgene and a chimeric site-specific recombinase gene, functioning 
together, to effect transgene expression in a regulated manner. The specificity of the 
recombination will be determined by the specificity of regulated promoters as well 
as the use of wild-type or mutant site-specific sequences. Both elements of the 
system can be chromosomally integrated and inherited independently. 
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"Target gene" refers to a gene on the replicon that expresses the desired 
target coding sequence, functional RNA, or protein. The target gene is not essential 
for repUcon replication. Additionally, target genes may comprise native non-viral 
genes inserted into a non-native organism, or chimeric genes, and will be under the 
5 control of suitable regulatory sequences. Thus, the regulatory sequences in the 
target gene may come from any source, including the virus. 

"Transcription Stop Fragment" refers to nucleotide sequences that contain 
one or more regulatory signals, such as polyadenylation signal sequences, capable of 
terminating transcription. Examples include the 3' non-regulatory regions of genes 
10 encoding nopaline synthase and the small subunit of ribulose bisphosphate 
carboxylase. 

"Translation Stop Fragment" refers to nucleotide sequences that contain one 
or more regulatory signals, such as one or more termination codons in all three 
frames, capable of terminating translation. Insertion of a translation stop fragment 
1 5 adjacent to or near the initiation codon at the 5* end of the coding sequence will 
result in no translation or improper translation. Excision of the translation stop 
fragment by site-specific recombination will leave a site-specific sequence in the 
coding sequence that does not interfere with proper translation using the initiation 
codon. 

20 "Blocking fragment" refers to a DNA fragment that is flanked by site 

specific sequences that can block the transcription and/or the proper translation of a 
coding sequence resulting in an mactive transgene. When the blocking fragment 
contains polyadenylation signal sequences and other sequences encoding regulatory 
signals capable of terminating transcription, it can block the transcription of a 

25 coding sequence when placed in the 5' non-translated region, i.e., between the 
transcription start site and the ORF. When inserted in the coding sequence a 
blocking fragment can block proper translation by disrupting its open reading frame. 
DNA rearrangement by site-specific recombination can restore transcription and/or 
proper translatabihty. For example, excision of the blocking fragment by site- 

30 specific recombination leaves behind a site-specific sequence that allows 

20 



transcription and/or proper translatability. A Transcription or Translational Stop 
Fragment will be considered a blocking fragment. 

The terms ''in cis'' and "zw trans'' refer to the presence of DNA elements, 
such as the viral origin of replication and the replication protein(s) gene, on the 
same DNA molecule or on a different DNA molecule, respectively. 

The terms "c/5-acting sequence" and "c/^-acting element" refer to DNA or 
RNA sequences whose functions require them to be on the same molecule. An 
example of a ci5-acting sequence on the repUcon is the viral replication origin. 

The terms '7ra«5-acting sequence" and '7/-aw5-acting element" refer to DNA 
or RNA sequences whose function does not require them to be on the same 
molecule. 

"Cz5-acting viral sequences" refers to viral sequences necessary for viral 
replication (such as the replication origin) and in cis orientation. 

"Transactivating gene" refers to a gene encoding a transactivating protein. It 
can encode a viral repHcation protein(s) or a site-specific repHcase. It can be a 
natural gene, for example, a viral repUcation gene, or a chimeric gene, for example, 
when regulatory sequences are operably-linked to the open reading frame of a site- 
specific recombinase or a viral repHcation protein. "Transactivating genes" may be 
chromosomally integrated or transiently expressed. 

"Wild-type" refers to the normal gene, virus, or organism foimd in nature 
without any known mutation. 

"Genome" refers to the complete genetic material of an organism. 
The term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides and 
polymers thereof in either single- or double-stranded form, composed of monomers 
(nucleotides) containing a sugar, phosphate and a base which is either a purine or 
pyrimidine. Unless specifically limited, the term encompasses nucleic acids 
containing known analogs of natural nucleotides which have similar binding 
properties as the reference nucleic acid and are metabolized in a manner similar to 
naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic 
acid sequence also impUcitly encompasses conservatively modified variants thereof 
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(e.g., degenerate codon substitutions) and complementary sequences as well as the 
sequence explicitly indicated. Specifically, degenerate codon substitutions may be 
achieved by generating sequences in which the third position of one or more 
selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues 
5 (Batzer et al., Nucleic Acid Res. , 19, 5081 (1991); Ohtsuka et al., J. Biol. Chem., 
260, 2605 (1985); Rossolini et al., Mol. Cell. Probes , 8, 91 (1994)). A "nucleic acid 
fragment" is a fraction of a given nucleic acid molecule. In higher animals, 
deoxyribonucleic acid (DNA) is the genetic material while ribonucleic acid (RNA) 
is involved in the fransfer of mformation contained within DNA into proteins. A 

1 0 "genome" is the entire body of genetic material contained in each cell of an 

organism. The term "nucleotide sequence" refers to a polymer of DNA or RNA 
which can be single- or double-sfranded, optionally containing synthetic, non- 
natural or altered nucleotide bases capable of incorporation into DNA or RNA 
polymers. The terms "nucleic acid" or "nucleic acid sequence" may also be used 

1 5 interchangeably witii gene, cDNA, DNA and RNA encoded by a gene. 

The invention encompasses isolated or substantially purified nucleic acid or 
protein compositions. In the context of the present invention, an "isolated" or 
"purified" DNA molecule or an "isolated" or "purified" polypeptide is a DNA 
molecule or polypeptide that, by the hand of man, exists apart from its native 

20 environment and is therefore not a product of nature. An isolated DNA molecule or 
polypeptide may exist in a purified form or may exist in a non-native environment 
such as, for example, a fransgenic host cell. For example, an "isolated" or "purified" 
nucleic acid molecule or protein, or biologically active portion thereof, is 
substantially free of other cellular material, or culture medium when produced by 

25 recombinant techniques, or substantially free of chemical precursors or other 

chemicals when chemically synthesized. Preferably, an "isolated" nucleic acid is 
free of sequences (preferably protein encoding sequences) that naturally flank the 
nucleic acid (i.e., sequences located at the 5' and 3' ends of the nucleic acid) in the 
genomic DNA of the organism from which the nucleic acid is derived. For 

30 example, in various embodunents, the isolated nucleic acid molecule can contain 
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less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of nucleotide sequences 
that naturally flank the nucleic acid molecule in genomic DNA of the cell from 
which the nucleic acid is derived. A protein that is substantially free of cellular 
material includes preparations of protein or polypeptide having less than about 30%, 

5 20%, 10%, 5%, (by dry weight) of contaminating protein. When the protein of the 
invention, or biologically active portion thereof, is recombinantly produced, 
preferably culture medium represents less than about 30%, 20%, 10%, or 5% (by 
dry weight) of chemical precursors or non-protein-of- mterest chemicals. Fragments 
and variants of the disclosed nucleotide sequences and proteins or partial-length 

1 0 proteins encoded thereby are also encompassed by the present invention. By 
"fragment" is intended a portion of the nucleotide sequence or a portion of the 
amino acid sequence, and hence a portion of the polypeptide or protein, encoded 
thereby. Alternatively, fragments of a nucleotide sequence that are usefril as 
hybridization probes generally do not encode fragment proteins retaining biological 

1 5 activity. Thus, fragments of a nucleotide sequence may range from at least about 9 
nucleotides, about 12 nucleotides, about 20 nucleotides, about 50 nucleotides, about 
100 nucleotides or more. 

By "variants" is intended substantially similar sequences. For nucleotide 
sequences, variants include those sequences that, because of the degeneracy of the 

20 genetic code, encode the identical amino acid sequence of the native protein. 

Naturally occurring alleUc variants such as these can be identified with the use of 
well-known molecular biology techniques, as, for example, with polymerase chain 
reaction (PGR) and hybridization techniques. Variant nucleotide sequences also 
include synthetically derived nucleotide sequences, such as those generated, for 

25 example, by using site-directed mutagenesis which encode the native protein, as 
well as those that encode a polypeptide having ammo acid substitutions. Generally, 
nucleotide sequence variants of the invention will have at least 40, 50, 60, to 70%, 
e.g., preferably 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, to 79%, generally at 
least 80%, e.g., 81%-84%, at least 85%, e.g., 86%, 87%, 88%, 89%, 90%, 91%, 
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92%, 93%, 94%, 95%, 96%, 97%, to 98% sequence identity to the native nucleotide 
sequence. 

By "variant" polypeptide is intended a polypeptide derived from the native 
protein by deletion (so-called truncation) or addition of one or more amino acids to 
the N-terminal and/or C-terminal end of the native protein; deletion or addition of 
one or more amino acids at one or more sites in the native protein; or substitution of 
one or more amino acids at one or more sites in the native protein. Such variants 
may results form, for example, genetic polymorphism or from human manipulation. 
Methods for such manipulations are generally known in the art. 

Thus, the polypeptides of the invention may be altered in various ways 
including amino acid substitutions, deletions, truncations, and insertions. Methods 
for such manipulations are generally known in the art. For example, amino acid 
sequence variants of the polypeptides can be prepared by mutations in the DNA. 
Methods for mutagenesis and nucleotide sequence alterations are well known in the 
art. See, for example, Kunkel, Proc. Natl. Acad. Sci. USA , 82, 488 (1985); Kunkel 
et al., Methods in Enzvmol., 154, 367 (1987); U. S. Patent No. 4,873,192; Walker 
and Gaastra, eds., Techniques in Molecular Biology, MacMillan PubUshing 
Company, New York (1983) and the references cited therein. Guidance as to 
appropriate amino acid substitutions that do not affect biological activity of the 
protein of interest may be found in the model of Dayhoff et al.. Atlas of Protein 
Sequence and Structure, Natl. Biomed. Res. Found., Washington, CD. (1978), 
herein incorporated by reference. Conservative substitutions, such as exchanging 
one amino acid with another having similar properties, are preferred. 

Thus, the genes and nucleotide sequences of the invention include both the 
naturally occurring sequences as well as mutant forms. Likewise, the polypeptides 
of the invention encompass both naturally occurring proteins as well as variations 
and modified forms thereof Such variants will continue to possess the desired 
activity. The deletions, insertions, and substitutions of the polypeptide sequence 
encompassed herein are not expected to produce radical changes in the 
characteristics of the polypeptide. However, when it is difficult to predict the exact 
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effect of the substitution, deletion, or insertion in advance of doing so, one skilled in 
the art will appreciate that the effect will be evaluated by routine screening assays. 

"Expression cassette" as used herein means a DNA sequence capable of 
directing expression of a particular nucleotide sequence in an appropriate host cell, 
comprising a promoter operably linked to the nucleotide sequence of interest which 
is operably linked to termination signals. It also typically comprises sequences 
required for proper translation of the nucleotide sequence. The coding region 
usually codes for a protein of interest but may also code for a functional RNA of 
interest, for example antisense RNA or a nontranslated RNA, in the sense or 
antisense direction. The expression cassette comprising the nucleotide sequence of 
interest may be chimeric, meaning that at least one of its components is 
heterologous with respect to at least one of its other components. The expression 
cassette may also be one which is naturally occurring but has been obtained in a 
recombinant fonn useful for heterologous expression. The expression of the 
nucleotide sequence in the expression cassette may be under the control of a 
constitutive promoter or of an inducible promoter which initiates transcription only 
when the host cell is exposed to some particular external stimulus. In the case of a 
multicellular organism, the promoter can also be specific to a particular tissue or 
organ or stage of development. 

The proteins of the invention may be altered in various ways including 
amino acid substitutions, deletions, truncations, and insertions. Methods for such 
manipulations are generally known in the art. Methods for mutagenesis and 
nucleotide sequence alterations are well known in the art. See, for example, Kunkel, 
Proc. Natl. Acad. Sci. USA , 82:488-492 (1985); Kunkel et al., Methods in Enzymol. 
154:367-382 (1987); US Patent No. 4,873,192; Walker and Gaastra, eds. (1983) 
Techniques in Molecular Biology (MacMillan Pubhshing Company, New York) and 
the references cited thereui. Guidance as to appropriate amino acid substitutions 
that do not affect biological activity of the protein of interest may be found in the 
model of Dayhoff et al. (1978) Atlas of Prote in Sequence and Structure (Natl. 
Biomed. Res. Found., Washington, D.C.), herein incorporated by reference. 
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Conservative substitutions, such as exchanging one amino acid with another having 
similar properties, may be preferred. 

Thus, the genes and nucleotide sequences of the invention include both the 
naturally occurring sequences as well as mutant forms. Likewise, the proteins of the 
5 invention encompass both naturally occurring proteins as well as variations and 
modified forms thereof Such variants will continue to possess the desired disease 
resistance activity. Obviously, the mutations that will be made in the DNA 
encoding the variant must not place the sequence out of reading frame and 
preferably will not create complementary regions that could produce secondary 
10 mRNA structure. See, EP Patent Application Publication No. 75,444. 

The deletions, insertions, and substitutions of the protein sequence 
encompassed herein are not expected to produce radical changes in the 
characteristics of the protein. However, when it is difficult to predict the exact 
effect of tiie substitution, deletion, or insertion in advance of doing so, one skilled in 
15 the art will appreciate that the effect will be evaluated by routine screening assays. 
Hybridization of such sequences may be carried out under stringent conditions. 
"Stringent hybridization conditions" and "stringent hybridization wash 
O conditions" in the context of nucleic acid hybridization experiments such as 

2 . ii 

S Southern and Northern hybridization are sequence dependent, and are different 

20 under different environmental parameters. Longer sequences hybridize specifically 
O . . . 

P4 at higher temperatures. An extensive guide to the hybridization of nucleic acids is 

found in Tijssen, Laboratorv Techniques in Biochemistr v and Molecular biology- 

Hvbridization with Nucleic Acid Probes, page 1, chapter 2 "Overview of principles 

of hybridization and the strategy of nucleic acid probe assays" Elsevier, New York 

25 (1993). Generally, highly stringent hybridization and wash conditions are selected 
to be about 5" C lower than the thermal meltmg point (T J for the specific sequence 
at a defined ionic strength and pH. Typically, under "stringent conditions" a probe 
will hybridize to its target subsequence, but to no other sequences. For example, by 
"stringent conditions" or "stringent hybridization conditions" is intended conditions 

30 under which a probe will hybridize to its target sequence to a detectably greater 
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degree than to other sequences (e.g., at least 2- fold over backgroimd). By 
controlling the stringency of the hybridization and/or washing conditions, target 
sequences that are 100% complementary to the probe can be identified (homologous 
probing). Alternatively, stringency conditions can be adjusted to allow some 
5 mismatching in sequences so that lower degrees of similarity are detected 

(heterologous probing). Generally, a probe is less than about 1000 nucleotides in 
length, preferably less than 500 nucleotides in length. 

Typically, stringent conditions will be those in which the salt concentration 
is less than about 1.5 M Na ion, typically about 0,01 to 1. 0 M Na ion concentration 
1 0 (or other salts) at pH 7.0 to 83 and the temperature is at least about 30° C for short 
probes (e.g., 10 to 50 nucleotides) and at least about 60* C for long probes (e.g., 
greater than 50 nucleotides). Stringent conditions may also be achieved with the 

0 addition of destabilizing agents such as formamide. 

Exemplary low stringency conditions include hybridization with a buffer 
15 solution of 30 to 35% fbrmamide, 1 M NaCl, 1% SDS (sodium dodecyl sulphate) at 

1 37*^ C, and a wash in IX to 2X SSC (20X SSC = 3.0 M NaCl/0.3 M trisodium 
citrate) at 50 to 55** C. Exemplary moderate stringency conditions include 

Q hybridization in 40 to 45% fbrmamide, 1.0 M NaCl, 1% SDS at 37° C, and a wash 

p in 0.5X to IX SSC at 55 to 60° C. Exemplary high stringency conditions include 

J 20 hybridization in 5 0% formamide, 1 M NaCl, 1% SDS at 37^C, and a wash m 0. IX 

II SSC at 60 to 65° C. 

Specificity is typically the function of post-hybridization washes, the critical 
factors being the ionic strength and temperature of the final wash solution. For 
DNA-DNA hybrids, the can be approxunated from the equation of Meinkoth and 
25 Wahl Anal. Biochem. 138:267-284 (1984); 81.5° C + 16.6 (log M) +0.41 (%GC) 
- 0.61 (% form) - 500/L; where M is the molarity of monovalent cations, %GC is the 
percentage of guanosine and cytosine nucleotides in the DNA, % form is the 
percentage of formamide in the hybridization solution, and L is the length of the 
hybrid in base pairs. The T^^ is the temperature (imder defined ionic strength and 
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pH) at which 50% of a complementary target sequence hybridizes to a perfectly 
matched probe. 

Very stringent conditions are selected to be equal to the for a particular 
probe. An example of stringent hybridization conditions for hybridization of 
5 complementary nucleic acids which have more than 100 complementary residues on 
a filter in a Southem or Northern blot is 50% formamide with 1 mg of heparin at 42^ 
C, with the hybridization being carried out overnight. An example of highly 
stringent conditions is 0.1 5 M NaCl at IT C for about 15 minutes. An example of 
stringent wash conditions is a 0.2x SSC wash at 65** C for 15 minutes (see, 

10 Sambrook, infra, for a description of SSC buffer). Often, a high stringency wash is 
preceded by a low stringency wash to remove background probe signal. An 
example of medium stringency for a duplex of, e.g., more than 100 nucleotides, is 
Ix SSC at 45"* C for 15 minutes. An example low stringency wash for a duplex of, 
e.g., more than 100 nucleotides, is 4-6x SSC at 40'' C for 15 minutes. For short 

15 probes (e.g., about 10 to 50 nucleotides), stringent conditions typically involve salt 
concentrations of less than about l.OM Na ion, typically about 0.01 to 1.0 M Na ion 
concentration (or other slats) at pH 7.0 to 8.3, and the temperature is typically at 
least about 30** C. Stringent conditions can also be achieved with the addition of 
destabilizing agents such as formamide. In general, a signal to noise ratio of 2x (or 

20 higher) than that observed for an unrelated probe in the particular hybridization 
assay indicates detection of a specific hybridization. Nucleic acids that do not 
hybridize to each other under stringent conditions are still substantially identical if 
the proteins that they encode are substantially identical. This occurs, e.g., when a 
copy of a nucleic acid is created using the maximum codon degeneracy permitted by 

25 the genetic code. 

The following are examples of sets of hybridization/wash conditions that 
may be used to clone homologous nucleotide sequences that are substantially 
identical to reference nucleotide sequences of the present invention: a reference 
nucleotide sequence preferably hybridizes to the reference nucleotide sequence in 

30 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO^, 1 mM EDTA at 50°C with 
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washing in 2X SSC, 0.1% SDS at 50°C, more desirably in 7% sodium dodecyl 
sulfate (SDS), 0,5 M NaP04, 1 mM EDTA at 50^C with washing in IX SSC, 0.1% 
SDS at 50*^0, more desirably still in 7% sodium dodecyl sulfate (SDS), 0.5 M 
NaP04, 1 mM EDTA at SOT with washing in 0.5X SSC, 0.1% SDS at 50T, 
preferably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 50T 
with washing in O.IX SSC, 0.1% SDS at SOT, more preferably in 7% sodium 
dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at SOT with washing in O.IX 
SSC, 0.1%SDSat65T. 

is reduced by about TC for each 1% of mismatching; thus, T^,,, 
hybridization, and/or wash conditions can be adjusted to hybridize to sequences of 
the desired identity. For example, if sequences with >90% identity are sought, the 
T^ can be decreased lO'^C. Generally, stringent conditions are selected to be about 
S'^C lower than the thermal melting point (T J for the specific sequence and its 
complement at a defmed ionic strength and pH. However, severely stringent 
conditions can utiUze a hybridization and/or wash at 1, 2, 3, or 4**C lower than the 
thermal melting point (T J; moderately stringent conditions can utilize a 
hybridization and/or wash at 6, 7, 8, 9, or lOT lower than the thermal melting point 
(T J; low stringency conditions can utihze a hybridization and/or wash at 1 1, 12, 13, 
14, 15, or 20''C lower than the thermal melting point (TJ. Using these parameters, 
hybridization and wash compositions, and desired T, those of ordinary skill will 
understand that variations in the stringency of hybridization and/or wash solutions 
are inherently described. If the desired degree of mismatching results in a T of less 
than 45T (aqueous solution) or 32**C (formamide solution), it is preferred to 
increase the SSC concentration so that a higher temperature can be used. An 
extensive guide to the hybridization of nucleic acids is found in Tijssen (1993) 
Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with 
Nucleic Acid Probes, Part 1, Chapter 2 (Elsevier, New York); and Ausubel et al., 
eds. (1995) Current Protocols in Molecular Biology, Chapter 2 (Greene PubUshing 
and Wiley - Interscience, New York). See Sambrook et al. (1989) Molecular 
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Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, 
Plainview, New York). 

Generally, stringent conditions are selected to be about 5°C lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength and 
5 pH, However, stringent conditions encompass temperatures in the range of about 
I'^C to about lO^'C, depending upon the desired degree of stringency as otherwise 
qualified herein. Nucleic acids that do not hybridize to each other under stringent 
conditions are still substantially identical if the polypeptides they encode are 
substantially identical. This may occur, e.g., when a copy of a nucleic acid is 
10 created using the maximum codon degeneracy permitted by the genetic code. One 
indication that two nucleic acid sequences are substantially identical is when the 
polypeptide encoded by the first nucleic acid is immunologically cross reactive with 
Q the polypeptide encoded by the second nucleic acid. 

"Vector" is defined to include, inter alia, any plasmid, cosmid, or phage in 
2 1 5 double or single stranded linear or circular form which may or may not be self 

O transmissible or mobilizable, and which can transform prokaryotic or eukaryotic 

'J'^ host either by integration into the cellular genome or exist extrachromosomally (e.g. 

y autonomous replicating plasmid with an origin of replication), 

p Specifically included are shuttle vectors by which is meant a DNA vehicle 

L^x 20 capable, naturally or by design, of replication in two different host organisms, which 

II may be selected fi-om actinomycetes and related species, bacteria and eucaryotic 

(e.g. higher cell, mammahan, yeast or fimgal cells). 

Preferably the nucleic acid in the vector is under the control of, and operably 
linked to, an appropriate promoter or other regulatory elements for transcription in a 
25 host cell such as a microbial, e.g. bacterial, or animal cell. The vector may be a bi- 
fimctionai expression vector w^hich functions in multiple hosts. In the case of 
genomic DNA, this may contain its own promoter or other regulatory elements and 
in the case of cDNA this may be under the control of an appropriate promoter or 
other regulatory elements for expression in the host cell. 
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"Cloning vectors" typically contain one or a small number of restriction 
endonuclease recognition sites at which foreign DNA sequences can be inserted in a 
determinable fashion without loss of essential biological function of the vector, as 
well as a marker gene that is suitable for use in the identification and selection of 
cells transformed with the cloning vector. Marker genes typically include genes that 
provide tetracycline resistance, hygromycin resistance or ampicillin resistance. 

"Operably linked" means joined as part of the same nucleic acid molecule, 
suitably positioned and oriented for transcription to be initiated fi-om the promoter. 
DNA operably linked to a promoter is "under transcriptional uiitiation regulation" of 
the promoter. 

"Chimeric" is used to indicate that a DNA sequence, such as a vector or a 
gene, is comprised of more than one DNA sequences of distinct origin with are 
fused together by recombinant DNA techniques resulting in a DNA sequence, which 
does not occur naturally. 

The terms "heterologous DNA sequence," "exogenous DNA segment" or 
"heterologous nucleic acid," as used herein, each refer to a sequence that originates 
fi-om a source foreign to the particular host cell or, if from the same source, is 
modified from its original form. Thus, a heterologous gene in a host cell includes a 
gene that is endogenous to the particular host cell but has been modified through, for 
example, the use of DNA shuffling. The terms also include non-naturally occurring 
multiple copies of a naturally occurring DNA sequence. Thus, the terms refer to a 
DNA segment that is foreign or heterologous to the cell, or homologous to the cell 
but in a position within the host cell nucleic acid in which the element is not 
ordinarily found. Exogenous DNA segments are expressed to yield exogenous 
polypeptides. 

A "homologous" DNA sequence is a DNA sequence that is naturally 
associated with a host cell into which it is introduced. 

The following terms are used to describe the sequence relationships between 
two or more nucleic acids or polynucleotides: (a) "reference sequence", 
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(b) "comparison window", (c) "sequence identity", (d) "percentage of sequence 
identity", and (e) "substantial identity". 

As used herein, "reference sequence" is a defined sequence used as a basis 
for sequence comparison. A reference sequence may be a subset or the entirety of a 
5 specified sequence; for example, as a segment of a fiill length cDNA or gene 
sequence, or the complete cDNA or gene sequence. 

As used herein, "comparison window" makes reference to a contiguous md 
specified segment of a polynucleotide sequence, wherein the polynucleotide 
sequence in the comparison window may comprise additions or deletions (i.e., gaps) 

10 compared to the reference sequence (which does not comprise additions or 

deletions) for optimal alignment of the two sequences. Generally, the comparison 
window is at least 20 contiguous nucleotides in length, and optionally can be 30, 40, 
50, 100, or longer. Those of skill in the Bit understand that to avoid a high similarity 
to a reference sequence due to inclusion of gaps in the polynucleotide sequence a 

1 5 gap penalty is typically introduced and is subtracted fi"om the number of matches. 

Methods of ahgnment of sequences for comparison are well known in the 
art. Thus, the determination of percent identity between any two sequences can be 
accomplished using a mathematical algorithm. Preferred, non-limiting examples of 
such mathematical algorithms are the algorithm of Myers and Miller, CABIOS 

20 4:11-17 (1988); the local homology algorithm of Smith et al. Adv. Appl. Math. 

2:482 (1981); the homology ahgnment algorithm of Needleman and Wunsch J. MoL 
Biol. 48:443-453 (1970); the search-for-similarity-method of Pearson and Lipman 
Proc. Natl. Acad. Sci. 85:2444-2448 (1988); the algorithm of Karlin and Altschul, 
Proc. Nath. Acad Sci. USA 872264 (1990), modified as in Karlin and Altschul, 

25 Proc. Nath. Acad. Sci. USA 90:5873-5877 (1993). 

Computer implementations of these mathematical algorithms can be utilized 
for comparison of sequences to determine sequence identity. Such implementations 
include, but are not limited to: CLUSTAL in the PC/Gene program (available firom 
Intelligenetics, Mountain View, California); the ALIGN program (Version 2.0) and 

30 GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
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Software Package, Version 8 (available from Genetics Computer Group (GCG), 575 
Science Drive, Madison, Wisconsin, USA). Alignments using these programs can 
be performed using the default parameters. The CLUSTAL program is well 
described by Higgins et al. Gene 73:237 244 (1988); Higgins et al. CABIOS 5:151- 
5 153 (1989); Corpet et al. Nucleic Acids Res. 16:10881-90 (1988); Huang et al. 
CABIOS 8:155-65 (1992); and Pearson et al. Meth. MoL Biol. 24:307-331 (1994), 
The ALIGN program is based on the algorithm of Myers and Miller, supra. The 
BLAST programs of Altschul et al, L Mol. Biol 215:403 (1990), are based on the 
algorithm of Karlin and Altschul supra. To obtain gapped aUgnments for 
10 comparison purposes, Gapped BLAST (in BLAST 2.0) can be utilized as described 
in Altschul et al. Nucleic Acids Res. 25:3389 (1997). Alternatively, PSI-BLAST (in 
BLAST 2.0) can be used to perform an iterated search that detects distant 
% relationships between molecules. See Altschul et al, supra. When utilizing 

9 BLAST, Gapped BLAST, PSI-BLAST, the default parameters of the respective 

5 1 5 programs (e.g. BLASTN for nucleotide sequences, BLASTX for proteins) can be 

J used. The BLASTN program (for nucleotide sequences) uses as defaults a 

hI wordlength (W) of 1 1 , an expectation (E) of 10, a cutoff of 100, M-5, N=-4, and a 

o comparison of both strands. For amino acid sequences, the BLASTP program uses 

W as defaults a wordlength (W) of 3, an expectation (E) of 10, and the BLOSUM62 

M= 20 scoring matrix (see Henikoff & Henikoff; Proc. Natl Acad. Sci. USA, 89,10915 

pj (1 989)). See http://www.ncbLnlm.iiih.gov . Alignment may also be performed 

manually by inspection. 

For purposes of the present invention, comparison of nucleotide sequences 
for determination of percent sequence identity disclosed herein is preferably made 
25 using the BlastN program (version 1 .4.7 or later) with its defauh parameters or any 
equivalent program. By "equivalent program" is intended any sequence comparison 
program that, for any two sequences in question, generates an ahgnment having 
identical nucleotide or amino acid residue matches and an identical percent sequence 
identity when compared to the corresponding alignment generated by the preferred 
30 program. 



As used herein, "sequence identity" or "identity" in the context of two 
nucleic acid or polypeptide sequences makes reference to the residues in the two 
sequences that are the same when aligned for maximum correspondence over a 
specified comparison window. When percentage of sequence identity is used in 
5 reference to proteins it is recognized that residue positions which are not identical 
often differ by conservative amino acid substitutions, where amino acid residues are 
substituted for other amino acid residues with similar chemical properties (e.g., 
charge or hydrophobicity) and therefore do not change the functional properties of 
the molecule. When sequences differ in conservative substitutions, the percent 
10 sequence identity may be adjusted upwards to correct for the conservative nature of 
the substitution. Sequences that differ by such conservative substitutions are said to 
have "sequence similarity" or "similarity." Means for making this adjustment are 
well known to those of skill in the art. Typically this involves scoring a 
conservative substitution as a partial rather than a full mismatch, thereby increasing 
15 the percentage sequence identity. Thus, for example, where an identical amino acid 
is given a score of 1 and a non-conservative substitution is given a score of zero, a 
conservative substitution is given a score between zero and 1, The scoring of 
conservative substitutions is calculated, e.g., as implemented in the program 
PC/GENE (Intelligenetics, Mountain View, California). 
20 As used herein, "percentage of sequence identity" means the value 

m determined by comparing two optimally aligned sequences over a comparison 

window, wherein the portion of the polynucleotide sequence in the comparison 
window may comprise additions or deletions (i.e., gaps) as compared to the 
reference sequence (which does not comprise additions or deletions) for optimal 
25 aUgnment of the two sequences. The percentage is calculated by determining the 
number of positions at which the identical nucleic acid base or amino acid residue 
occurs in both sequences to yield the number of matched positions, dividing the 
number of matched positions by the total number of positions in the window of 
comparison, and multipl5dng the result by 100 to yield the percentage of sequence 
30 identity. 
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The term "substantial identity" of polynucleotide sequences means that a 
polynucleotide comprises a sequence that has at least 70%, 71%, 72%, 73%, 74%, 
75%, 76%, 77%, 78%, or 79%, preferably at least 80%, 81%, 82%, 83%, 84%, 85%, 
86%, 87%, 88%, or 89%, more preferably at least 90%, 91%, 92%, 93%, or 94%, 
and most preferably at least 95%, 96%, 97%, 98%, or 99% sequence identity, 
compared to a reference sequence using one of the alignment programs described 
using standard parameters. One of skill in the art will recognize that these values 
can be appropriately adjusted to determine corresponding identity of proteins 
encoded by two nucleotide sequences by taking into account codon degeneracy, 
amino acid similarity, reading frame positioning, and the like. Substantial identity 
of amino acid sequences for these purposes normally means sequence identity of at 
least 70%, more preferably at least 80%, 90%, and most preferably at least 95%. 

Another indication that nucleotide sequences are substantially identical is if 
two molecules hybridize to each other under stringent conditions. Generally, 
stringent conditions are selected to be about 5**C lower than the thermal melting 
point (T J for the specific sequence at a defined ionic strength and pH. However, 
stringent conditions encompass temperatures in the range of about 1**C to about 
20''C, depending upon the desired degree of stringency as otherwise qualified herein. 
Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides they encode are substantially identical. 
This may occur, e.g., when a copy of a nucleic acid is created using the maximum 
codon degeneracy permitted by the genetic code. One indication that two nucleic 
acid sequences are substantially identical is when the polypeptide encoded by the 
first nucleic acid is immunologically cross reactive with the polypeptide encoded by 
the second nucleic acid. 

The term "substantial identity" in the context of a peptide indicates that a 
peptide comprises a sequence with at least 70%, 71%, 72%, 73%, 74%, 75%, 76%, 
77%, 78%, or 79%, preferably 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 
or 89%, more preferably at least 90%, 91%, 92%, 93%, or 94%, or even more 
preferably, 95%, 96%, 97%, 98% or 99%, sequence identity to the reference 
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sequence over a specified compaiison window. Preferably, optimal alignment is 
conducted using the homology alignment algorithm of Needleman and Wunsch J. 
MoL BioL 48:443-453 (1970). An indication that two peptide sequences are 
substantially identical is that one peptide is immunologically reactive with 
5 antibodies raised against the second peptide. Thus, a peptide is substantially 

identical to a second peptide, for example, where the two peptides differ only by a 
conservative substitution. 

Detailed Description of the Invention 

10 This invention relates to peptide amino acid absorption in the dog, and more 

particularly, to separate, whole or partial-length, complementary DNA encoding 
putative canine low-affinity, high-capacity H7peptide transport proteins (cPepTl), 
mRNA transcripts corresponding to cPepTl, characterization of cPepTl by 
glycylsarcosine (GlySar) uptake, identification of dipeptides, tripeptides, and 

1 5 tetrapeptides well recognized by cPepTl , and the effect of supplemental peptide 
substrate on the transport capacity of cPepTl. 

The invention also provides a pet food composition comprising at least one 
dipeptide, tripeptide, or tetrapeptide that provides enhanced uptake of amino acids 
by PepTl. A typical canine diet for use in the present invention may also, for 

20 example, contain about 20 to about 30% crude protein, about 10 to about 20% fat, 
and about 10% total dietary fiber. However, no specific ratios or percentages of 
these or other nutrients are required. 

The inventors have discovered a method for identifying peptides (e.g. 
dipeptides, tripeptides, or tetrapeptides) that increase transport of amino acids by 

25 PepTl using MDCK cells, particularly when incubated with lactalbumin 

hydrolysate and assayed at optimum time post-seeding, as indicated in Example 2. 

In order that the invention may be more readily understood, reference is 
made to the following examples which are intended to illustrate the invention, but 
not limit the scope thereof 

30 
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EXAMPLE 1 
Generation of Partial-length Canine PepTl cDNA 

Partial cloning of canine PepTl (cPepTl) from small intestinal epithelium. 
Initial attempts (over 150) to partially clone the putative canine PepTl 
5 cDNA by reverse transcriptase-polymerase chain reaction (RT-PCR) methodology 
failed. The source of mRNA was canine liver tissue that had been frozen for about 6 
months (supplied by Dr. Randal Buddington, Mississippi State University) and 
oligomer primers were based on the rabbit PepTl sequence. Subsequently, frozen 
canine "mid" small intestine (jejunal) tissue segments became available (supplied by 

1 0 Dr. Buddington) and a partial length cDNA of about 780 base pairs (bp) was cloned 
by RT-PCR. Total RNA was isolated from jejunal epithelium scraped from 
intestinal sections using a standard acidic phenol-chloroform protocol. One \xg of 
mRNA was isolated from total RNA using POLY A TRACT SYSTEM® (Promega, 
Madison, WI) and reversed transcribed using murine leukemia virus reverse 

1 5 transcriptase (Perkin Elmer, Foster City, CA) and oligo(dT) primers (Gibco BRL, 
Grand Island, N.Y.). Successftil PCR reactions were 50 |iL and contained 1 faM 
MgCl2 and Taq polymerase (Perkin Elmer). Twenty-five thermal cycles of 94''C for 
1 min, 40''C for 45 sec, and ll^'C for 1 min were used. The cycles were preceded by 
a 55 sec denaturization of the RT product at 95''C, followed by a 10 min extension 

20 of RT-PCR products at 72T. More than 150 RT-PCR reactions testing ten different 
primer sets were required to achieve this protocol. The resulting cDNA using 
Primer Set 4 (Figure 1) was TA-cloned into the pCR®II vector (Invitrogen, Carlsbad, 
CA), plasmid-containing colonies selected by blue/white screening, and ampHfied 
following instructions of the manufacturer. Restriction analyses of recovered 

25 pCR®II/cDNA plasmids revealed that four of fifty-six clones contained cDNA 
consistent with rabbit PepTl cDNA (Figure 2). 

Northern blot analysis of cPepTl expression in dog tissue and MDCK cells. 
The potential expression of cPepTl mRNA by canine kidney, small 
30 intestinal epithehum, and immortalized kidney distal tubule epithelial cells (Madin- 
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Darby Canine Kidney, MDCK) was evaluated by Northern analyses using cDNA 
derived from canine jejunal epithelium (Figure 3). RNA were subjected to 1% gel 
electrophoresis in the presence of 0.02 M formaldehyde, transferred by downward 
capillary action to 0.45 -jum nylon membranes (Hybond-N, Amersham, Arlington 
Heights, IL), and covalently cross-linked by ultra-violet light. cDNA were 
randomly labeled with [^^P]-CTP using a kit (Gibco BRL), purified through 
Sephadex-50 columns (Amersham Pharmacia, Piscataway, NJ.), and hybridized 
with blots at 56°C for 18 h. The blots were then washed 2 times at 56°C for 15 min 
and once at 57°C for 10 min. Autoradiographs were exposed to blots at 80°C for 24 
h and the size of the transcript determined by regression of hybridized bands against 
the migration distance of 18S (1.9 kb) and 28S (4.9 kb) RNA. 

Each canine tissue-derived cDNA (TA-clone 26, Figure 3A; TA-clone 6, 
Figure 3B) hybridized to three mRNA species in dog kidney, dog small intestinal 
epithelium, and MDCK cells. To confirm identification of PepTl mRNA by these 
canine cDNAs, RNA isolated from dog kidney and liver tissues were probed for 
expression of PepTl mRNA using a fiiU-length rabbit PepTl cDNA (Figure 4; 
rabbit PepTl cDNA supphed by Drs. F. Leibach and V. Ganapathy, Medical 
College of Georgia). The results also demonstrated the expression of the same three 
PepTl mRNA species by dog tissues, indicating that the fiiU-length rabbit PepTl 
cDNA and the cDNA derived from canine tissue in the present study identified the 
same transcripts. The mean/SD of transcript sizes calculated from these three blots 
were 4.2/.22, 2.75/.26, and 1.46/.42 kb, respectively. Collectively, these data 
indicate that liver, intestinal epitheKal, and MDCK cells express the same size and 
number of PepTl transcripts. In comparison, various tissues of chicken, sheep, cow, 
pig, rabbit, rat, human, and Caco2 cells are reported to express a single transcript, 
with the principle difference in size being between chicken (1.9) and mammalian 
species (2.8, 2.8, 2.9, 2.9, 3.0, 3.1, 2.9, respectively 
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Partial cloning and sequence identification of canine PepTl (cPepTl) cDNA from 
MDCK cells. 

To confimi the positive Northern analysis, identification of PepTl mRNA 
expression using cDNA generated from dog small intestinal epithelium, RT-PCR 
methodologies were used to generate a PepTl cDNA from MDCK cells. The target 
cDNA region was a subset of the cDNA generated by RT-PCR from canine small 
intestine (bp 83 to 887 of rabbit PepTl). Accordingly, PCR primers that 
corresponded to bp 259 to 619 of rabbit PepTl (GenBank acc. no. U06467) were 
used to generate a partial-length "canine PepTl" (cPepTl) cDNA from mRNA 
isolated from MDCK cells. RNA was collected from cells that were plated at 
30,000 cm^ on rat tail collagen-coated dishes and cultured for 3 days in 10% fetal 
calf serum/DMEM. Reverse transcription of 5 |Lig of total RNA by 
SUPERSCRIPT® II reverse transcriptase (Gibco-BRL) was performed using 
random and oligo(dT) primers, per instructions of the manufacturer (Gibco-BRL). 
All PCR reactions contained 2 mM MgCl2 and thermal cycling using Taq 
polymerase included 30 cycles at 94''C for 2 min, 55^C for 1 min, and 72°C for 2 
min. The cycles were preceded by a 10 min denaturization of the RT product at 
94T, followed by a 10 min extension of RT-PCR products at 72^C. More than one 
hundred RT-PCR reactions were required to achieve this protocol. 

The resulting cDNA of about 380 bp was TA-cloned, into the site of pCR®II 
vector (as described above), amplified, bacterial colonies evaluated by blue/white 
screening, and pCR®II/cDNA plasmids evaluated for cDNA by Eco Rl/Pst I 
restriction analysis (as described above). Restriction analyses of recovered plasmids 
revealed that six of thirty-six clones contained cDNA consistent with rabbit PepTl 
cDNA . Two of the confirmed plasmids were amplified in bacteria, recovered, and 
sent for sequencing by the University of Florida DNA Sequencing Core Facility 
(Gainesville). Sequence comparisons of this 380 bp cDNA (Figure 5) to PepTl 
sequences of other species using BLAST 2.0.14. software (blast@ncbi.nlm.nih.gov) 
revealed that the canine sequence shares sequence homology of 79% to rabbit (bp 
259 to 640; GenBank acc. no. 473375), 83% to rat (bp 213 to 593; GenBank acc. no. 

39 



i 



ill 



D50664.1), 83% to mouse (bp 213 to 589; GenBank acc. no. AF205540), and 87% 
to human (bp 285 to 665; GenBank acc. no. 473375 and U13173) PepTl sequences. 

Demonstration of PepTl -like transport activity in MDCK cells. 

5 As seen in Figures 3 and 5, MDCK cells express a canine homolog of 

mammalian PepTl mRNA. Potential expression of PepTl transport activity (H""- 
dependent, dipeptide inhibitable, low-affmity dipeptide transport) by confluent 
MDCK cells was evaluated using whole-cell transport techniques and 
glycylsarcosine (GlySar) as a model dipeptide substrate. Cells were seeded at 

10 60,000 cells/cm^ into 24-well trays that had been coated with rat tail collagen or 
poly-L-lysine and cultured (95%02:5% CO2 at 37°C) for 3 d in media consisting of 
Dulbecco's Modified Eagle Medium/10% fetal calf serum/1% antimicrobial 
antibacterial medium. Absorption (pmols/mg protein) of [^H]-glycyl-L-sarcosine 
(GlySar; 6 mCi/mL, Moravek Biochemicals, Brea, CA) was determined using the 

1 5 24-well cluster tray method and representative scintillation counting. Before 

transport, cells were incubated at 37°C for 30 nain in 25 mM Hepes/Tris (pH 7.5), 
140 mM NaCl, 5.4 mM KCl, 1.8 mM CaCl^, 0.8 mM MgS04, and 5 mM glucose 
(uptake buffer) to normalize intracellular amino acid and peptide pools. Transport 
was initiated by the addition of 0.25 mL of uptake buffer that contained 2.88 [xm 

20 GlySar. After 30 min of uptake at 37T, transport was terminated by rapid washing 
of cells with 4 X 2 mL 4°C uptake buffer. Cellular protein was precipitated with 
10% trichloroacetic acid and the supernatant recovered and counted to determine 
radioactivity (^H) content. Cellular protein was then solubiUzed in 0.2 N NaOH and 
0.2% SDS and quantified by the Lowry procedure, using bovine serum albumin as a 

25 stmdard. The amount of H''-dependent GlySar absorbed was calculated as the 
difference between uptake in pH 6.0 and pH 7.5 uptake buffers. The amount of 
competitor substrate-inhibitable GlySar uptake was calculated as the quotient of 
GlySar uptake in the absence and presence of 10 mM competitor substrate 
(dipeptide or amino acid) multipUed by 100%. 
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GlySar uptake in the presence of an intracellularly H"" gradient (extracellular 
pH of 6.0) was 2.3-fold higher in cells plated on collagen, and 1. 7-fold higher when 
grown on poly-L-lysine, than uptake in pH 7.5 medium (Table 1). H'^-dependent 
uptake of GlySar by MDCK cells was inhibited by 88 or 92% by the presence of 10 
5 mM LeuTrp or TrpLeu when grown on collagen, and 87 or 92% when grown on 
poly-L-lysine, respectively (Table 1). 

Table 1. Influence of extracellular pH and competitor substrates on uptake of [^H]- 
glycylsarcosine by MDCK cells cultured on collagen- or poly-L-lysine-coated trays. 
10 Cells were cultured as described in text and uptake compared in pH 7.5 or 6.0 media 
that contained 2.88 [^H]-glycylsarcosine for 30 min. 



n 


n 


Extracellular 


Competitor 


Glysylsarcosine 
uptake pmol 


% 

inhibition 


^1 


pH 


substrate(mM) 


30 min"' mg"' 


of pH 6.0 










protein 


uptake 




Collagen-coated 








hid 


5 


7.5 


none 


19.9 ±2.80 


na' 


fu 


5 


6.0 


none 


65.3 ± 7.95 


100 




5 


6.0 


LeuTrp (10) 


7.68 ±1.37 


11.7 




5 


6.0 


TrpLeu (10) 


5.21 ± 0.39 


8.0 




4 


6.0 


Leucine (10) 


63.0 ±4.00 


96.3 




Poly-L-lysine-coated 










4 


7.5 


none 


15.52 ± 1.06 


na 


ly 


5 


6.0 


none 


42.31 ±4.03 


100 




5 


6.0 


LeuTrp (10) 


5.50 ±0.58 


13.0 




5 


6.0 


TrpLeu (10) 


3.44 ± 0.27 


8.1 




4 


6.0 


Leucine (10) 


41.93 ±2.70 


100 



^ na, not appUcable 



15 To preliminarily characterize the kinetic parameters of peptide transport by 

MDCK cells, the uptake of GlySar in media that contained pH 6.0 and 0.00064, 
0.0025, 0.010, 0.04, 0.160, 0.640, 2.56, or 10,2 mM of GlySar was measured 
(Figure 6). Total GlySar uptake was by a relatively low-affinity mechanism 
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(apparent of about 4.0 mM) and high uptake velocity. Collectively, these 
characteristics of GlySar uptake are consistent with functional activity of PepTl 
expressed by other species, as opposed to high-affinity, HT-dependent uptake by 
PepT2 (^m K^. Accordingly, it is concluded that MDCK cells possess PepTl -like 
5 activity, consistent with detection of PepTl mRNA by RT-PCR (Figures 1, 2, 5) and 
Northern blot analyses (Figures 3,4). 

Summary of Example 1 

Separate partial-length canine PepTl cDNAs (cPepTl) were generated by 
10 RT-PCR analyses from dog small intestinal epithelium (n = 2; Figures 1, 2) and 
immortahzed canine kidney cells (MDCK cells, n 1). The MDCK cDNA was 
sequenced (Figure 5) and found to share 79 to 87% sequence identity with PepTl 
mRNA expressed by other mammalian species. Northem blot analyses using the 
3 intestinal epithelium-derived RT-PCR cDNA confirmed expression of canine PepTl 

15 (cPepTl) by dog tissues (liver, n = 3; kidney, n = 3; small intestine n = 1) and 
5 MDCK cells (n = 2). The identification of mRNA transcripts corresponding to 

II PepTl using partial-length canine-derived PepTl cDNA (Figure 3) was confirmed 

□ by hybridization to full-length rabbit cDNA (Figure 4). Characterization of GlySar 

uptake by MDCK cells demonstrated that MDCK cells express PepTl -like activity 
^ 20 (Table 1, Figure 5), confirming detection of PepTl mRNA expression by MDCK 

cells and use of MDCK cells as a model to ch^acterize the fimction of canine 
PepTl. 

EXAMPLE 2 

25 Experimental Model of MDCK Cells for Evaluating the Effects of Various 
Peptide and Drug Substrates, and Hormones And/or Growth Factors, on the 

Expression of PepTl Activity 

Example 1 above showed that (1) a canine homolog of PepTl (cPepTl) 
mRNA cloned from epitheUa of the mid small intestine (jejunum) shares high 
30 sequence identity with PepTl expressed by several other species, (2) canine liver, 
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kidney, and jejunal epithelium express a similar pattern of cPepTl mRNA, and 
(3) MDCK cells are capable of H'^-dependent peptide uptake. Accordingly, MDCK 
cells are an appropriate model to evaluate the biochemical characteristics of cPepTl. 
The specific goals of this research were to (1) characterize the functional activity of 
5 low-affinity H^-dependent GlySar uptake (PepTl activity) by MDCK cells and 
(2) identify di- and tripeptides that are well recognized by cPepTl (cPepTl), 
especially those that contain tryptophan and leucine. 

Previous research (Brandsch et al, 1994, Biochem J. 299:253-260) briefly 
reported that H^-dependent peptide uptake by MDCK cells was greater when cells 

10 were grown in a medium that contained lactalbumin hydrolysate (LHM) versus one 
that contained free amino acids (DMEM). Therefore, in an attempt to establish the 
most sensitive model possible for evaluating peptide transport systems in MDCK 
cells, the potential influences of LHM (peptide-containing) versus DMEM (peptide- 
lacking) media, and subconfluent versus confluent initial cell plating densities were 

15 compared. MDCK cells were seeded at either 60,000 cells/well (subconfluent) or 
120,000 cells/well (confluent) in DMEM and, after 1 d, cultured in DMEM or LHM 
media for 1, 2, 3, or 5 d. The amount of protein (index of cell growth) and GlySar 
uptake (index of peptide uptake capacity) expressed by each well of cells was then 
determined. As seen in Figure 7, the amount of cellular protein increased (P < .05) 

20 for both seeding densities and media with time of culture. A time x media 

interaction was observed, which reflects the greater protein content of cells grown in 
DMEM at day 6, as compared to those grown in LHM. At days 2, 3, or 4, however, 
no difference in protein content was observed. 

The uptake of ['H]-GlySar (2.88 |llM, 5 |iCi/mL) by the MDCK cells 

25 described in Figure 7 was measured in the presence (pH 6.0 uptake buffer) and 
absence (pH 7.4 uptake buffer) of an extracellular-to-intercellular H^ (proton) 
gradient. A representative graph (Figure 8) compares the uptake of GlySar by cells 
seeded at 60,000/well and cultured in the LHM or DMEM. For both culture media, 
GlySar uptake in the presence of pH 6.0 was greater (P < .01) than that in pH 7.4 

30 buffer md displayed a quadratic (P < .01) response to length of culture, reflecting a 
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buffer X day of culture interaction (P < .01). DMEM-cultured cells seeded at 
120,000/well displayed almost identical uptake characteristics as just described for 
cells seeded at 60,000/welL In contrast, GlySar uptake in the presence of pH 6.0 
buffer at day 3 by LHM-cultured cells was only 28% larger (quantitatively) than that 
5 observed by DMEM-cultured cells seeded at 60,000/well. 

To further refine the analysis of media influence on the peptide transport 
capacity of MDCK cells plated at 60,000 or 120,000 cells per well, the H^- 
dependent GlySar uptake was calculated as the arithmetic difference between uptake 
in pH 6.0 and pH 7.4 buffers (Figure 9). Despite the comparable protein contents of 
10 cells observed at day 3 (Figure 7), cells seeded at 60,000 and grown in LHM media 
demonstrated about 60% greater capacity for GlySar uptake as did cells grown in 
DMEM (Figure 9; day x media interaction, P < .01). For all cells, the capacity for 
^ GlySar uptake per mg of cellular protein was decreased at day 6. This difference 

O was the result of a lesser uptake at pH 6.0 by the LHM-cultured cells, and not the 

y 15 result of a larger pH 7.4 uptake. 

+ The results of this experiment indicate that culturing cells in media that 

fU contains peptides does not increase growth rate but does increase the capacity for 

p peptide uptake if cells are seeded at 60,000/well and cultured for 2 days in LHM. 

As such, these data are consistent with the induction of PepTl expression by culture 
H 20 peptide-containing medhmi and describe ati optimal set of culture conditions for 

K characterizing ff-dependent peptide transport activity of the canine PepT 1 

transporter. These data also confirm, and more thoroughly describe, the stimulating 
effect of LHM versus DMEM media on peptide transport proteins that was initially 
reported by Brandsch et al. (1994). 
25 Using the maximal uptake-stimulating culture parameters determined in 

Experiment 3, the effect of an extracellular-to-intracellular pH gradient on GlySar 
uptake was further evaluated to determine a pH level at which maximal GlySar 
uptake could be achieved, but which would repUcate physiologic conditions 
(Figure 10). As expected, the presence of a pH gradient stimulated (P < .001) H^- 
30 dependent GlySar uptake, in a quadratic (P < .01) fashion. Uptake at pH 5.5 or 6.0 
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was about 2.7 times greater than that achieved at pH 7.5, These results are 
consistent with the data in Figures 8 and 9 and known H'^-dependence of 
mammalian peptide transport proteins. Accordingly, the use pH 6.0 buffers for the 
characterization of H^-dependent GlySar uptake was incorporated into the standard 
5 experimental conditions. 

To determine the appropriate time period to measure initial (linear) rates of 
GlySar uptake, a by-minute time-course experiment was performed. As seen in 
Figure 11, iT-dependent GlySar (100 uM) uptake increased linearly for 1 h and then 
slowed (quadratic response, P < .01). GlySar uptake in pH 6.0 buffer at 3.75, 7.5, 

10 15, 30, 60 and 120 min was about 2, 2.1, 2.25, 2.65, 2.79, and 2,62 times more (P 
< .001), respectively, than uptake from pH 7.4 buffer. Because uptake was 
proportional to time of uptake through 1 h, future experiments were conducted using 
a 30-min time period. 

To confirm that H'^-dependent GlySar uptake was saturable, and therefore 

1 5 mediated, the uptake of GlySar from pH 6.0 and 7.4 uptake buffers containing 

0.025, 0.1, 0.4, 1.6, 6.4, or 25.6 mM GlySar was evaluated (Figure 12). Uptake of 
GlySar was greatest (P < .001) from the pH 6.0 buffers, at all concentrations. H""- 
dependent GlySar uptake was saturable, consistent with an apparent IC„, for GlySar 
of about 1.1 mM. These values are consistent with our preliminary trials that 

20 estimated a of 1 . 1 mM for GlySar uptake by MDCK cells using only pH 6.0 

uptake buffer and indicate that H^-dependent GlySar uptake is predominately, if not 
completely, a result of low affinity (mM) HVpeptide cotransporter activity (PepTl). 
As a comparative value, the reported of for GlySar uptake by the PepTl- 
expressing Caco-2 cells also is 1 . 1 mM. It is of interest also to note that GlySar 

25 uptake in the absence of a pH gradient (pH 7.4 buffers) also displayed linear (P < 
.01) and quadratic (P < .001) components, (1) reflects that the pH "7.4" buffer was 
in fact sUghtly acidic, (2) represents the activity of the putative basalateral peptide 
transporter running in "reverse", or (3) indicates the presence of a non-characterized 
peptide transport system. As a result of this experiment, subsequent H^-dependent 



45 



peptide transport trials were conducted using 100 |liM GlySar, a value well below 
the but one that will result in increased transport activity, and thus, sensitivity. 

Characteristic hallmarks of low affinity H7peptide cotransport activity, 
classically defined using membrane vesicles of several species, and more recently 
by functional expression studies using human, rat, and rabbit PepTl cDNA, is the 
recognition of some, but not all, (3-lactam antibiotics. In addition, PepTl 
recognition of cefadroxil is low (the IC, of cefadroxil inhibition of GlySar uptake by 
PepTl is 3 mM), whereas recognition of cefadroxil by PepT2 is high (the Ki of 
cefadroxil inhibition of PepT2 transport of GlySar is 30 |aM). To determine whether 
MDCK cPepTl activity shared these functional features, the uptake of 100 |aM 
GlySar in the absence and presence of pH 7.5 and pH 6.0 buffer, and, in pH 6.0 
buffers, the presence of 1 mM additional GlySar (self-inhibitor control), 3 mM 
Penicillin-G, 30 |iM cefadroxil, or 3 mM cefadroxil was compared (Figure 13). IT- 
dependent GlySar uptake was not inhibited by penicillin-G or 30 |liM cefadroxil, but 
was inhibited about 76% by 3 mM cefadroxil. As expected, the presence of 1 mM 
GlySar self-inhibited 100 |liM GlySar uptake by 64%. These results indicate that 
H+-dependent uptake of GlySar by MDCK cells is by PepTl activity. 

Other hallmarks of PepTl function are the decreased ability of Gly- 
containing peptides to inhibit GlySar, in proportion to their length, and sensitivity to 
inhibition by camosine (P-Ala-His). To determine if cPepTl activity behaves as 
reported for other PepTl activities, the relative abiUties of 1 mM Gly ([^H]-Gly free 
amino acid control), GlyGly, [Gly]4, or [Gly]^ to inhibit H^-dependent 100 jaM 
GlySar was determined (Figure 14). Gly (5.0%) and [GlyJs (7.3%) did not influence 
uptake, whereas GlyGly inhibited and [Gly]4 tended to inhibit uptake by 63 and 
23%, respectively. This pattern of Gly-containing peptides to inhibit GlySar uptake 
in an inverse proportion to the number of glycyl residues in the canine MDCK cell 
model is consistent with PepTl activities reported for other species. Similarly, 
GlySar uptake was inhibited 50%o by 1 mM camosine (data not shown but listed in 
Table 2 below). 
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Together with the molecular identification of PepTl mRNA expression in 
MDCK cells using full-length rabbit cDNA and our canine RT-PCR product (See 
Example 1 data), the above biochemical characterization data indicate that H^- 
dependent GlySar uptake activity in MDCK cells is consistent with the low-affmity, 
5 high-capacity of the PepTl transport protein. Collectively, the above experiments 
resulted in the generation of an experimental regimen for the culture and 
determination of H''-dependent peptide transport activity in MDCK cells, with 
which to evaluate the relative substrate preferences of canine PepTl (cPepTl). 

Accordingly, the following general regimen was used to perform a series of 
10 experiments that evaluated the relative abilities of candidate di- (primarily) and tri- 
peptides to inhibit GlySar uptake by endogenously expressed cPepTl in MDCK 
cells: 

1 . Sixty thousand cells/well were plated into collagen-coated 24-well 
trays and cultured at Sl^'C in an atmosphere of 95% air/5% COj in 

1 5 DMEM/1 0% PCS that contained antibiotics for 1 day. 

2. The media was removed and cells were cultured in LHM/10% 
FCS/antibiotics for 1 day. 

3. The media was removed and cells cultured in LHM/10% PCS (no 
antibiotics) for 20 h. 

20 4. The media was removed and cells cultured for 30 min in air at 37''C 

in depletion medium (25 mM Hepes/Tris (pH 7.5), 140 mM NaCl, 5.4 mM 
KCl, 1.8 mM CaC12, 0.8 mM MgS04, and 5 mM glucose, to normalize 
intracellular nutrient pools. 

5. Transport was initiated by replacing depletion medium with uptake 
25 medium (depletion medium adjusted to pH 6.0 or kept at pH 7.4) that 

contained 100 |j,M GlySar (at a specific activity of 5 jaCi/mL, with [^H]- 
GlySar supplying 2.88% of total GlySar substrate) and (or) 1 mM of 
inhibiting peptide. 

An inhibitory substrate concentration of 1 mM was selected because the 
30 literature indicates that typical K„, values for PepTl ranges from 0,5 to 5 mM. 
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Therefore, by selecting an inhibitor concentration of 1 mM (not expected to 
completely inhibit uptake), our goal was to more finely delineate the relative 
abilities of candidate inhibitors than if the typical 5 mM inhibitor concentration 
(expected to achieve close to 100% inhibition of GlySar uptake) was used. 
Candidate peptides were selected based on their containing Trp, Leu, Met, and (or) 
Arg, substrates. In total, 23 inhibitory peptides and 2 drug compounds were 
screened using this protocol. 

To determine the potential of Trp and Leu absorption as dipeptides by 
cPepTl, the ability of TrpLeu versus LeuTrp dipeptides to inhibit 100 fxM GlySar 
uptake was evaluated (Figure 15), The presence of either TrpLeu or LeuTrp in the 
pH 6.0 uptake buffer abolished H^-dependent GlySar uptake by 1 17% or 1 14%), 
respectively. In contrast, neither Leu nor Trp significantly influenced H''-dependent 
GlySar uptake. These results indicate that a lesser concentration of inhibitor would 
be required to delineate the relative recognition of TrpLeu and LeuTrp by cPepTl. 
With regard to the mechanism of BT-mdependent GlySar uptake observed 
throughout these experiments, it is of interest to note that TrpLeu and LeuTrp 
inhibited H^-zwdependent GlySar uptake by 36% and 46%, respectively. 

To further evaluate the potential of Trp to be absorbed in the form of 
peptides by cPepTl, the ability of TrpTrp, TrpGly, and TrpGlyGly to inhibit GlySar 
uptake was compared (Figure 16). As observed for TrpLeu (Figure 15), TrpTrp 
aboHshed H^-dependent GlySar uptake and inhibited iT-zndependent uptake by 
about 22%. TrpGly abolished H^-dependent GlySar uptake but did not influence 
ff-mdependent uptake. The tripeptide TrpGlyGly also significantly inhibited 
GlySar uptake, but to a lesser extent (73%) than did TrpTrp or TrpGly. 

To determine the relative potential of other amino acids (Met, Arg, Lys, Phe, 
for example) to be absorbed in the peptide-bound form, additional GlySar 
competitive inhibition experiments were conducted using the above-described 
regimen and a variety candidate peptides at 1 mM. The results of these experiments 
are summarized in Table 2, which also includes those experiments described in 
Figures 13, 14, 15, and 16 for comparative purposes. 
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Table 2, Influence of ImM extracellular peptides and antibiotics on 100 jaM 
glycylsarcosine (GlySar) uptake^ by MDCK cells. 
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"^That portion of the % inhibition value that exceeds 100, likely represents the 
ability to inhibit H'^-mdependent GlySar uptake. 
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The inhibitors are listed within groupings in order of their relative ability to 
inhibit 100 |j,M of GlySar uptake. In addition to the listed peptides, the constituent 
free amino acids were tested within the appropriate experiment to evaluate whether 
the peptide-bound or free amino acid was responsible for any affect on GlySar 
5 uptake. As expected, the presence of 1 mM constituent free amino acid did not 
influence GlySar uptake. Inhibition percentages of 50% indicate that the inhibitor 
substrate was recognized at least as well as was GlySar, given that the of GlySar 
was determined to be about 1 mM (Figure 12) and that the substrate was present at 1 
mM. Of the 19 treatment peptides evaluated, eleven abohshed H''-dependent GlySar 

10 uptake, with seven of these also displaying the ability to inhibit H''4ndependent 
GlySar uptake. Of the remaining eight peptides tested, four displayed greater than 
80% inhibition while four inhibited GlySar uptake by 50% or less. These results 
indicate that a wide variety of peptides of nutritionally important constituent amino 
acids are recognized by cPepTl. 

15 Overall, the observation that cPepTl activity was sensitive to a number of 

substrates is typical of PepTl ftinction. However, what was surprising was the large 
number of peptides that completely inhibited GlySar uptake. To estabhsh a more 
sensitive relative inhibitory order among peptides that inhibited GlySar uptake by 
more thaa 80%o, and, therefore, a more accurate potential for recognition, fourteen 

20 peptides were re-screened for their ability to inhibit 100 jiM GlySar uptake using the 
same cell culture aad tr^sport regimen but using only 10% of the previous inhibitor 
concentration (100 |aM). The data from an experiment to directly compare the 
ability of 100 jliM Trp-containing peptides are shown in Figure 17. All Trp- 
containing peptides inhibited ff-dependent GlySar uptake. However, TrpLeu 

25 inhibited more (92%) than did LeuTrp (58%), TrpTrp (62%), or TrpGly (45%). 
These values and the results of other experiments comparing the relative ability of 
Leu-, Met-, and Arg-containing peptides are listed in Table 3. 
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Table 3. Influence of 100 ^iM extracellular peptides on 100 |j,M glycylsarcosine 
(GlySar) uptake' by MDCK cells. 
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Overall, four of the peptides inhibited GlySar uptake by at least 80%, six by 
more thaa 40%, and four less than 40%, thus establishing a relative ranking for 

35 recognition by cPepTL Among the five Trp-containing peptides (Figure 17, Table 
3), TrpLeu demonstrated the greatest ability to inhibit GlySar uptake. TrpLeu also 
demonstrated the greatest ability to inhibit GlySar uptake (94%) among the Leu- 
containing peptides. Among the Met-containing substrates that were directly 
compared within the same experiment, the neutral peptides, MetMet and MetPhe, 

40 inhibited more GlySar uptake than did the anionic (MetGlu) or cationic (MetLys) 
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carboxyl residues. Interestingly, as a group the Arg peptides demonstrated the least 
inhibitory ability, seemingly in keeping with the apparent lesser recognition by 
PepTl of substrates with charged residues. However, it is of interest to note that 
100 jiM ArgLeu demonstrated a much greater ability to inhibit GlySar uptake than 
5 did LeuArg (49 versus 8.9%). 

To confirm the relative ranking of TrpLeu>TrpTrp inhibition of GlySar 
(Tables 2 ^d 3), MichaeUs-Menton constants for substrate inhibition (K^) of GlySar 
uptake by TrpLeu and TrpTrp were generated by graphical analyses of IC50 
experiments (Figure 18). In keeping with the results achieved in the 100 |aM- 
1 0 inhibition studies, TrpLeu inhibited GlySar uptake at lower concentrations than did 
TrpTrp (K, = 0.2 versus 0.75 [xM, respectively). 

Collectively, the results of cPepTl competitive inhibition trials using MDCK 
cells indicate that TrpLeu is better recognized by cPepTl than any other tested 
peptide. The results also indicate that a number of Trp-, Leu, and Met-containing 
1 5 peptides also are well recognized by cPepTl . Ultimately, in the intestinal 
g environment, it is the combination of recognition by the transporter and relative 

resistance of the peptide to luminal and membrane-bound peptidases that will 
determine how much of a given peptide will be absorbed. In this regard, there is 
some evidence to suggest that Gly-X peptides are more resistant than other peptides, 
^ 20 especially by blood and renal peptidases. If so, then GlyLeu may be a better 

m candidate substrate than TrpLeu to supply Leu. Similarly, tripeptides, as a group, 

are thought to be relatively resistant to hydrolysis. Thus, more TrpGlyGly may 
prove to be absorbed in larger amounts by the intestine than TrpLeu. 

An important result of this set of experiments was the establishment of a 
25 sensitive experimental regimen/model to evaluate potential affecters of peptide 

transport capacity. Accordingly, this experimental model of MDCK cells grown in 
LHM affords an opportunity to evaluate the effects of various peptide and drug 
substrates, and hormones and (or) growth factors, on the expression of PepTl . 
Thus, the culture of MDCK cells in LHM versus DMEM results in an 
30 increase of ff-dependent GlySar uptake (K„, = 1 . 1 mM) that is consistent with 
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mammalian PepTl-like activity. Using this stimulated model, the ability of twenty- 
three di- and tripeptides at 1 mM, and fourteen at 100 p,M, extracellular 
concentrations were screened for their abihty to inhibit 100 iiiM GlySar uptake, as 
an indicator of recognition by PepTl . Of the Trp- and (or) Leu-containing peptides 

5 evaluated, TrpLeu (K^ = 02 |lxM) and LeuTrp (K, = 0.75 |iM) demonstrated the 
greatest abiUty to inhibit GlySar uptake, with TrpLeu demonstrating a relatively 
higher affinity (lower K,) for PepTl. Of the Met-containing peptides evaluated, four 
(MetMet, MetPhe, LeuMet, MetLeu) appear particularly well recognized by PepTl . 
In contrast, as a group, Arg-containing peptides displayed the least inhibition of 

10 PepTl activity. Overall, these results indicate that cPepTl is capable of recognizing 
a variety of di- and tripeptides, including, for example, those that contain leucine 
and tryptophan. 

EXAMPLE 3 

1 5 Experimental Model to Determine Whether the iT/peptide Transport Capacity 
Expressed by MDCK Cells Is Sensitive to Substrate Regulation 

Triall: 

Examples 1 and 2 above demonstrated that Madin-Darby canine kidney 
(MDCK) cells express PepTl mRNA and characterized H'^-dependent biochemical 

20 properties. Therefore, MDCK cells were chosen as the experimental model to 
determine whether the HVpeptide transport capacity expressed by MDCK cells is 
sensitive to substrate regulation. Research from Example 2 demonstrated that 
MDCK cells grown in lactalbumin hydrolysate medium (LHM) had elevated levels 
of peptide uptake capacity. Accordingly, to avoid potential confounding effects of 

25 the peptide-containing LHM and individual treatment peptides, DMEM (contains no 
peptides) and not LHM was selected as the appropriate medium to test the influence 
of extracellular peptides on canine PepTl functional capacity of MDCK cells. 
GlyPhe was selected as a substrate because it has been reported to increase brush 
border membrane content of PepTl, (Shiraga T, Miyamoto K, Tanaka H, 

30 Yamamoto H, Taketani Y, Morita K, Tamai I, Tsuji A, Takada E, Cellular and 
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molecular mechanisms of dietary regulation on rat intestinal H+/peptide transporter 
PepTl. Gastroenterology 1999; 1 16:354-362), whereas Phe and Gly were tested as 
constituent free amino acid treatment controls. Camosine was selected because of 
its high content in meat-based diets. 

Cell culture. All cells were plated (60,000/2cmVell) and cultured (95% 
air/5% CO2, 37°C) for 24 h in Dulbecco's Modified Eagle Media/10% fetal calf 
serum (FCS)/l%Antibiotic/ Antimicrobial solution (ABAM) (DMEM media). 
Following these initial common culture conditions, cells then were cultured in 
DMEM, or DMEM that contained 10 mM of Camosine, GlyPhe, Phe, or Gly. 
Media were changed every 24 h. Media treatments (n = 8) were as follows: 

DMEM 

DMEM + 10 mM Camosine 
DMEM + 10 mM GlyPhe 
DMEM + 10 mM Phe 
DMEM+ lOmMGl 

Uptake measurements. The measurement of f H]Glysarcosine uptake was 
performed by using a 24-well cluster tray method (Kilberg MS. Measurement of 
amino acid transport by hepatocytes in suspension and monolayer culture. Methods 
Enzym 1989; 173:564-575. Matthews JC, Aslanian A, McDonald KK, Yang W, 
Malandro MS, Novak DA, Kilberg MS. An expression system for mammahan 
amino acid transport using a stably maintained episomal vector. Anal Biochem 
1997; 254:208-214), and used in Examples 1 and 2. Cells were cultured for 30 mm 
in air at 37°C in depletion medium (25 mM Hepes/Tris (pH 7.5), 140 mM NaCl, 5.4 
mM KCl, 1.8 mM CaC12, 0.8 mM MgS04, and 5 mM glucose), to normalize 
intracellular nutrient pools before transport. The transport assays are initiated by 
replacing depletion medium with uptake medium (Depletion medium adjusted to pH 
6.0) that contained 100 ^M GlySar (5 ^iCi/mL, with [^H]-GlySar supplying 2.88% 
of total GlySar). After a 30 minute incubation period, transport was terminated with 
four rinses of 4°C depletion medium (pH 7.5). Two hundred and twenty |j,L of 10% 
trichloroacetic acid was added to each well, and the radioactivity of the supernatant 
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quantified by liquid scintillation counting. The cells of each well are solubilized in 
0.2 N NaOH/0.2% SDS and the protein quantified by using the modified Lowry 
assay, using bovine serum as a standard. Id. Peptide uptake will be reported as 
pmol* mg'^ protein* 30 mm\ Uptake measurements were taken after 24, 48, and 
5 72 hours of cultui*e in treatment media. 

Results. The previous research characterizing H^-dependent peptide 
transport by MDCK cells (Example 2 above) clearly showed that transport velocity 
is dependent on protein content. Therefore, to make a valid comparison of various 
treatment parameters on GlySar uptake, the protein content of compared treatment 
10 groups must not differ. Accordingly, the influence of culture media on MDCK 

cellular protein was evaluated (Figure 19). All media treatments supported cellular 
growth fi-om 1 to 3 d and no difference in protein content among treatments was 
observed. Similarly, no difference in uptake velocity (capacity) was observed 
among treatment groups, for any culture period (Figure 20). 

15 

Trial 2 

The results from Trial 1 suggest that either canine PepTl is not sensitive to 
substrate regulation or that the substrates and(or) stimulation time were inadequate 
to influence H''-dependent peptide uptake in MDCK cells. Again, DMEM was 

20 selected as the basal medium to allow the effect of individual peptides on peptide 
transport activity to be evaluated. To evaluate the latter two possibilities, a second 
trial was conducted that included a culture period of 9 d. GlySar was added as 
another potential affecter of H^-dependent peptide transport capacity because 10 
mM GlySar it is reported capable of stimulating increased PepTl activity (Adibi S. 

25 The oligopeptide transporter PepTl in human intestine: biology and function. 
Gastroenterology 1991 \ 1 13:332-340) in Caco-2 cells. GlyPro was added as a 
treatment because of its high content in muscle tissue, thus is hkely to be abundant 
in meat-based diets. 

Cell culture. The MDCK cell Une was maintained as described previously in 

30 the Methods section of Trial 1. Following initial and common culture conditions, 
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cells were cultured in DMEM, or DMEM that contained 10 mM GlySar, GlyPro, 
GlyPhe, or Camosine. Media were changed every 24 h. Media treatments (n = 8) 
were as follows: 
DMEM 

DMEM + lOmMGlySar 
DMEM+lOmMGlyPro 
DMEM+lOmM GlyPhe 
DMEM + 10 mM Camosine 

Uptake measurements. The measurement of [^HJGlysarcosine uptake was 
performed by using the 24-well cluster tray method as previously described in the 
Methods section of Trial 1. Peptide uptake will be reported as pmol* mg"^ protein* 
30 min \ Uptake measurements were taken after 4, 12, 24, 36, 72, 120, 168, and 
216 hours of culture in treatment media. 

Results. Protein content in all treatment groups increased linearly from 4 to 
216 h (9 d) of culture, for ail treatment groups (Figure 21). However, within a 
culture period, protein contents of treatment groups did not differ. Over the 216-h 
culture period, protein increased about 4.5 times, from about 40 to 220 )ag/well. In 
contrast to Trial 1 results, media treatment did influence GlySar uptake capacity 
(Figure 22). In addition, a treatment x time effect was observed that represents 
differences in the time of culture required for GlySar and camosine treatment 
stimulation of GlySar uptake capacity. Specifically, GlySar containing DMEM 
culture treatment resulted in an increase in GlySar uptake capacity of about 30% 
over DMEM control media by 24 h of culture time. This level of increase was 
maintained through 216 h. In contrast, culture in camosine-containing media did 
not result in a significant (23%) increase of GlySar uptake capacity over that by 
DMEM-cultured cells until 72 h of culture. This stimulation then steadily increased 
to 291% over 216 h of culture. The nature of stimulated uptake between the two 
peptide substrates also differed. That is, the magnitude of camosine-stimulated 
GlySar uptake was essentially constant from 72 to 216 h, whereas that for GlySar 
culture decreased during this period. Collectively, these data indicate that H"^- 
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dependent peptide transport in cultured MDCK cells can be stimulated by at least 
two of PepTl substrates, GlySar and camosine. 

Trials 

The data from Trial 2 indicate that H^-dependent GlySar uptake capacity by 
fed MDCK cells can be upregulated by the inclusion of 10 mM GlySar for at least 
24 h and 10 mM camosine for at least 72 h. It is of equal interest to understand if 
H''-dependent GlySar uptake capacity is sensitive to nutrient deprivation and(or) 
stimulation by glucocorticoids. A preliminary study indicates that fasting increases 
the expression of PepTl in rat small intestine epitheUa. Thamotharmi M, Bawani S, 
Zhou X, Adibi S. Functional and molecular expression of intestinal ohgopeptide 
transporter (PepTl) after a brief fast. Metabolism 1999; 48:681-684. 

To initiate investigation of potential influence of fasting and glucocorticoids 
on MDCK cells expression of GlySar uptake capacity, the H'^-dependent uptake of 
GlySar was evaluated over a 72 period of nutrient deprived or fed and cultured with 
dexamethasone (Dex) and compared to that by cells cultured in DMEM or DMEM 
that contained insulin (negative control) (Trial 3A). The "nutrient deprived" 
treatment actually contained 5 mM glucose and appropriate salts to ensure adequate 
basal metabolic conditions. 

Although recruitment of PepTl protein and activity appears sensitive to 
insulin-stimulated recruitment from cytosolic vesicles in Caco-2 cells (Thamotharan 
M, Bawani S, Zhou X, Adibi S. Hormonal regulation of oligopeptide transporter 
PepTl after a brief fast. Am J Physiol 1999; 276:C821-826, MDCK cells are 
reported to be insensitive to insulin, likely as an inability to express the insulin 
receptor. Hofinann C, Crettaz M, Bruns P, Hessel P, Hadawi G. Cellular responses 
ehcited by insulin mimickers in cells lacking detectable plasma membrane insulin 
receptors. J Cell Biol 1985; 27:401-414. In contrast to the lack of insulin 
sensitivity, IGF-I is known to stimulate DNA synthesis and cell proliferation in 
MDCK cells. Sukegawa I, Hizuka N, Takano K, Asakawa K, Shizume K. 
Characterization of IGF- 1 receptors on MDCK cell line. Endocrinol Japan 1987; 
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34(3):339-346. Mouzon SH, Kahn R. Insulin-like growth factor-mediated 
phosphorylation and proto-ontogeny induction in MDCK cells. Mol Endocrinol 
1991; 5:51-60. The understanding that MDCK cells are apparently insensitive to 
insulin stimulation yet are sensitive to IGF-I stimulation appears to be a paradox 
given that the supraphysiologic levels of both substrates employed in the perspective 
studies and the known ability of insulin to cross react with the IGF-I receptor. 
Accordingly, another trial (Trial 3B) was conducted to evaluate the influence of 
increasing IGF-I concentrations on H*-dependent GlySar uptake by MDCK of the 
same plating stock. 

Trial 3A: 

Cell culture. MDCK cells were maintained as described in Trial 1, except 
that cells were cultured for only 1 d before transport trials were performed. 
Following initial mid common culture conditions, cells were cultured in a "nutrient 
depleted" buffer (Hepes/Tris (pH 7.5), 140 mM NaCl, 5.4 mM KCl, 1.8 mM CaCl^, 
0.8 mM MgSOJ that contained 5 mM glucose as mi energy source, but that lacked 
amino acid or vitamin sources. In contrast, cells cultured in DMEM, or DMEM that 
contained 5 nM Dex, 500 nM Dex, 5 nM insulin, or 500 nM insulin, were 
adequately nourished. Media treatments (n = 4) were as follows: 
Nutrient depleted 

DMEM 

DMEM + 5 nM Dex 
DMEM + 500 nM Dex 
DMEM + 5 nM Insulin 
DMEM + 500 nM Insulin 

Uptake measurements. The measurement of [^H]Glysarcosine uptake was 
performed by using the 24-well cluster tray method as previously described in the 
Methods section of Trial 1. Peptide uptake is reported as pmol* mg ' protein*30 
min'\ Uptake measurements were taken after 30 min and 4 h of culture m treatment 
media. 
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Trial 3B: 

Trial 3B was conducted in the same manner as described for Trail 3 A, except 
that cells were cultured in DMEM or DMEM that contained 1 nM IGF-1, 5 nM IGF- 
1, 25 nM IGF-1, or 100 nM IGF-L Uptake measurements were taken after 30 min 
and 4 h of culture time. Media treatments (n = 4) were as follows: 

DMEM (pH 6 measurement) 

DMEM (pH 7.5 measurement) 

DMEM + lnMIGF-1 

DMEM + 5nMIGF-l 

DMEM + 25nMIGF-l 

DMEM + 100 nM IGF-1 

Results. Protein content of the treatments within Trails 3 A or 3B did not 
differ. After 4 h of culture, however, the capacity for Hf-dependent peptide uptake 
was reduced 35% in cells deprived of nutrients but adequate in energy (Figure 23). 
In contrast, dexamethasone had no effect on GlySar uptake. As expected, and 
consistent with the concept that MDCK cells are insulin-insensitive, the presence of 
insulin for 4 h had no effect on GlySar uptake capacity. Similariy, culture of cells 
with increasing amounts of IGF-I eUcited no significant stimulation of H^-dependent 
GlySar uptake (Figure 24). Quantitatively, however, 1 to 25 nM of IGF-I tended to 
increase GlySar uptake capacity by 10 to 15%. 

Given the noted restrictions of Trail 3, and the low number of observations 
(n = 4) results from trial 3 A and 3B suggest that iT-dependent uptake of GlySar by 
MDCK is sensitive to nutrient deprivation and, perhaps, IGF-I. 
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EXAMPLE 4 
PepTl Sequence 

Clonel2 (5* round; SEQ ID NO:ll) Primer Pair is GSP3-4;GSP3-1R using 
regular RT-PCR 

5 catcttcttcatcgtggtcaatgagttctgtgaaagattttcctactatggaatgagagcactcctgattctgtacttcagacg 
gttcatcgggtgggacgataatctgtccacggccatctaccacacgtttgtggctctgtgctacctgacgccgatcctcgg 
cgcactgatcgcagactcctggctgggaaagttcaagacaatcgtgtcactctccattgtctacacaattggacaggcggt 
cactgcagtaagctcaattaatgacctcacagactataacaaagatggaactcctgacaatctgtccgtgcatgtggcact 
gtccatgattggcctggccctgatagctctgggaactggaggaataaagccctgtgtgtctgcatttggtggagaccagtt 

10 tgaagagggccaggaaaaacaaagaaacagattcttttccatcttttatttggccattaatgctggaagcttgatttccactat 
tgtcactcccatgctcagagttcacgaatgtggaatttacagtcagaaagcttgttacccactggcatttggggttcctgctg 
ctctcatggccgtatctctgattgtatttgtcattggcagtggaatgtacaagaagtttcagccccagggtaatgtcatgggt 
aaagttgtcaagtgcattggttttgccctcaaaaataggtttaggcaccggagtaagcagtttcccaagagggagcactgg 
ctggactgggctaaagagaaatacgatgagcggctcatctctcaaattaagatggtcacaaaagtgatgttcttgtacatcc 

15 cactcccaatgttctgggccctgtttgaccagcagggctccaggtggacactgcaagcaacagctatgagtgggaaaatt 
ggacttcttgaagttcagccagatcagatgcagactgtgaatgccatcttgattgtcgtcatggtccccatcatggatgccg 
tggtgtaccctctgattgcaaaatgtggcttcaatttcacctccttgaagaggatgacagttggaatgttcctggcttccatgg 
ccttcgtgatggcggcgattgttcagctggaaattgataaaactcttccagtcttccccaaacaaaatgaagtccaaatcaa 
agtactgaatataggaaatggtgccatgaatgtatcttttcctggagcggtggtgacagttagccaaatgagtcaatcagat 

20 ggatttatgacttttgatgtagacaaactgacaagtataaacatttcttccactggatcaccagtcattccagtgacttataact 
ttgagcagggccatcgccatacccttctagtatgggcccccaataattaccgagtggtaaaggatggccttaaccagaa 

gccagaaaaagggag 

Amplification conditions 

25 





Initiale 


Denaturation 


Annealing 


Amplification 


Extension 


Cooling 




Denaturat 












Temp 
Min. 


94"C 
10 min 


94°C 
2min 


55'C 
1,5 min 


2 min 


72^C 
10 min 


4°C 
inf. 


Cycle 


I 




35 




1 
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Clone37 beginning (6* round; SEQ ID NO:12) Primer pair is GSP3-9; AUAP 
using 3'RACE Protocol 

gccatcgccatacccttctagtatgggcccccaataattaccgagtggtaaaggatggccttaaccagaagccagaaaa 
aggagaaaatggaatcagatttataaatagtcttaatgagagcctcaacatcaccatgggcgacaaagtttatgtgaatgtc 

5 accagtcacaatgccagcgagtatcagttcttttctttgggcacaaaaaacattacaataagttcaacacaacagatctcac 
aaaattgtacaaaagttctccaatcatccaaccttgaatttggtagtgcatatacctatgtaatcggaacgcagagcactggc 
tgccctgaattgcatatgtttgaagatatttcacccaacacagttaacatggctctgcagatcccgcagtacttcctcatcacc 
tgcggcgaggtggttttctctgtcacaggactggagttctcatattctcaggccccctccaacatgaagtcggtgcttcagg 
cgggatggctgctgacagtggcttgttggcaacatcattgtgctcattgtggcaggagcaggccagttcagtgaacagtg 

10 ggctgaatacatcctatttgcggcattgcttctggttgtctgtgtaatatttgccatcatggcccggttttacacttacgtcaatc 

cagcagagattg 



Amplification conditions 





Initiale 


Denaturation 


Annealing 


Amplification 


Extension 


Cooling 




Denaturat 












Temp 
Min. 


94°C 
10 


94°C 
2 min 


5TC 
1.5 min 


72°C 
2 


72^C 
10 


4'^C 
inf. 


Cycle 


1 




30 




1 





H 1 5 Merge Sequence (SEQ ID NO:8) is: 

S catcttcttcatcgtggtcaatgagttctgtgaaagattttcctactatggaatgagagcactcctgattctgtacttcagacgg 
p ttcatcgggtgggacgataatctgtccacggccatctaccacacgtttgtggctctgtgctacctgacgccgatcctcggc 
b gcactgatcgcagactcctggctgggaaagttcaagacaatcgtgtcactctccattgtctacacaattggacaggcggtc 
actgcagtaagctcaattaatgacctcacagactataacaaagatggaactcctgacaatctgtccgtgcatgtggcactgt 
20 ccatgattggcctggccctgatagctctgggaactggaggaataaagccctgtgtgtctgcatttggtggagaccagtttg 
aagagggccaggaaaaacaaagaaacagattcttttccatcttttatttggccattaatgctggaagcttgatttccactattg 
tcactcccatgctcagagttcacgaatgtggaatttacagtcagaaagcttgttacccactggcatttggggttcctgctgct 
ctcatggccgtatctctgattgtatttgtcattggcagtggaatgtacaagaagtttcagccccagggtaatgtcatgggtaa 
agttgtcaagtgcattggttttgccctcaaaaataggtttaggcaccggagtaagcagtttcccaagagggagcactggct 
25 ggactgggctaaagagaaatacgatgagcggctcatctctcaaattaagatggtcacaaaagtgatgttcttgtacatccc 
actcccaatgttctgggccctgtttgaccagcagggctccaggtggacactgcaagcaacagctatgagtgggaaaattg 
gacttcttgaagttcagccagatcagatgcagactgtgaatgccatcttgattgtcgtcatggtccccatcatggatgccgt 
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ggtgtaccctctgattgcaaaatgtggcttcaatttcacctccttgaagaggatgacagttggaatgttcctggcttccatgg 

ccttcgtgatggcggcgattgttcagctggaaattgataaaactcttccagtcttccccaaacaaaatgaagtccaaatcaa 

agtactgaatataggaaatggtgccatgaatgtatcttttcctggagcggtggtgacagttagccaaatgagtcaatcagat 

ggatttatgacttttgatgtagacaaactgacaagtataaacatttcttccactggatcaccagtcattccagtgacttataact 

ttgagcagggccatcgccatacccttctagtatgggcccccaataattaccgagtggtaaaggatggccttaaccagaa 

gccagaaaaaggagaaaatggaatcagatttataaatagtcttaatgagagcctcaacatcaccatgggcgacaaagttt 

atgtgaatgtcaccagtcacaatgccagcgagtatcagttcttttctttgggcacaaaaaacattacaataagttcaacacaa 

cagatctcacaaaattgtacaaaagttctccaatcatccaaccttgaatttggtagtgcatatacctatgtaatcggaacgca 

gagcactggctgccctgaattgcatatgtttgaagatatttcacccaacacagttaacatggctctgcagatcccgcagtac 

ttcctcatcacctgcggcgaggtggttttctctgtcacaggactggagttctcatattctcaggccccctccaacatgaagtc 

gotanttp.aggrffgffatffgr.tgrtgacagtggct fP-f^ggcaacarcarfg ^gcfeatfg/g^^ 
ca ^teaacaste92ctmatacatcctattteceecattscttctsstmctststaatattt2ccatcat^^^ 

acacttacstcaatccascagasatts 
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After analyzing the protein sequence and performing alignment with other species, the underlined, 
italicized was removed for submission to Genbank. 



5 Sequence to submit to Genbank (SEQ ID N0:7) 

catcttcttcatcgtggtcaatgagttctgtgaaagattttcctactatggaatgagagcactcctgattctgtacttcagacgg 

ttcatcgggtgggacgataatctgtccacggccatctaccacacgtttgtggctctgtgctacctgacgccgatcctcggc 
gcactgatcgcagactcctggctgggaaagttcaagacaatcgtgtcactctccattgtctacacaattggacaggcggtc 
actgcagtaagctcaattaatgacctcacagactataacaaagatggaactcctgacaatctgtccgtgcatgtggcactgt 
10 ccatgattggcctggccctgatagctctgggaactggaggaataaagccctgtgtgtctgcatttggtggagaccagtttg 
aagagggccaggaaaaacaaagaaacagattcttttccatcttttatttggccattaatgctggaagcttgatttccactattg 
tcactcccatgctcagagttcacgaatgtggaatttacagtcagaaagcttgttacccactggcatttggggttcctgctgct 
ctcatggccgtatctctgattgtatttgtcattggcagtggaatgtacaagaagtttcagccccagggtaatgtcatgggtaa 
agttgtcaagtgcattggttttgccctcaaaaataggtttaggcaccggagtaagcagtttcccaagagggagcactggct 
15 ggactgggctaaagagaaatacgatgagcggctcatctctcaaattaagatggtcacaaaagtgatgttcttgtacatccc 
Cj actcccaatgttctgggccctgtttgaccagcagggctccaggtggacactgcaagcaacagctatgagtgggaaaattg 
;j; gacttcttgaagttcagccagatcagatgcagactgtgaatgccatcttgattgtcgtcatggtccccatcatggatgccgt 

Isssl 

ry ggtgtaccctctgattgcaaaatgtggcttcaatttcacctccttgaagaggatgacagttggaatgttcctggcttccatgg 
f=i ccttcgtgatggcggcgattgttcagctggaaattgataaaactcttccagtcttccccaaacaaaatgaagtccaaatcaa 
^ 20 agtactgaatataggaaatggtgccatgaatgtatcttttcctggagcggtggtgacagttagccaaatgagtcaa^ 

M: ggatttatgacttttgatgtagacaaactgacaagtataaacatttcttccactggatcaccagtcattccagtgacttataact 
ttgagcagggccatcgccatacccttctagtatgggcccccaataattaccgagtggtaaaggatggccttaaccagaag 
ccagaaaaaggagaaaatggaatcagatttataaatagtcttaatgagagcctcaacatcaccatgggcgacaaagtttat 
gtgaatgtcaccagtcacaatgccagcgagtatcagttcttttctttgggcacaaaaaacattacaataagttcaacacaac 
25 agatctcacaaaattgtacaaaagttctccaatcatccaaccttgaatttggtagtgcatatacctatgtaatcggaacgcag 
agcactggctgccctgaattgcatatgtttgaagatatttcacccaacacagttaacatggctctgcagatcccgcagtactt 
cctcatcacctgcggcgaggtggttttctctgtcacaggactggagttctcatattctcaggccccctccaacatgaagtcg 
gtgcttcaggcgggatggctgctgacagtggct 



30 Canine PepTl Nucleotide Sequence (SEQ ID NO:20) 

atgggcatgtccaagtcatatggttgctttggttaccccttgagcatcttcttcatcgtggtcaatgagttctgtgaaagatttt 
cctactatggaatgagagcactcctgattctgtacttcagacggttcatcgggtgggacgataatctgtccacggccatcta 
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ccacacgtttgtggctctgtgctacctgacgccgatcctcggcgcactgatcgcagactcctggctgggaaagttcaaga 

caatcgtgtcactctccattgtctacacaattggacaggcggtcactgcagtaagctcaattaatgacctcacagactataa 

caaagatggaactcctgacaatctgtccgtgcatgtggcactgtccatgattggcctggccctgatagctctgggaact^^ 

aggaataaagccctgtgtgtctgcatttggtggagaccagtttgaagagggccaggaaaaacaaagaaacagattctttt 

ccatcttttatttggccattaatgctggaagcttgatttccactattgtcactcccatgctcagagttcacgaatgtggaatttac 

agtcagaaagcttgttacccactggcatttggggttcctgctgctctcatggccgtatctctgattgtatttgtcattggcagt 

ggaatgtacaagaagtttcagccccagggtaatgtcatgggtaaagttgtcaagtgcattggttttgccctcaaaaataggt 

ttaggcaccggagtaagcagtttcccaagagggagcactggctggactgggctaaagagaaatacgatgagcggctca 

tctctcaaattaagatggtcacaaaagtgatgttcttgtacatcccactcccaatgttctgggccctgtttgaccagcagggc 

tccaggtggacactgcaagcaacagctatgagtgggaaaattggacttcttgaagttcagccagatcagatgcagactgt 

gaatgccatcttgattgtcgtcatggtccccatcatggatgccgtggtgtaccctctGattgcaaaatgtggcttcaatttca 

cctccttgaagaggatgacagttggaatgttcctggcttccatggccttcgtgatggcggcgattgttcagctggaaattga 

taaaactcttccagtcttccccaaacaaaatgaagtccaaatcaaagtactgaatataggaaatggtgccatgaatgtatctt 

ttcctggagcggtggtgacagttagccaaatgagtcaatcagatggatttatgacttttgatgtagacaaactgacaagtat 

aaacatttcttccactggatcaccagtcattccagtgacttataactttgagcagggccatcgccatacccttctagtatggg 

cccccaataattaccgagtggtaaaggatggccttaaccagaagccagaaaaaggagaaaatggaatcagatttataaat 

agtcttaatgagagcctcaacatcaccatgggcgacaaagtttatgtgaatgtcaccagtcacaatgccagcgagtatcag 

ttcttttctttgggcacaaaaaacattacaataagttcaacacaacagatctcacaaaattgtacaaaagttctccaatcatcc 

aaccttgaatttggtagtgcatatacctatgtaatcggaacgcagagcactggctgccctgaattgcatatgtttgaagatat 

ttcacccaacacagttaacatggctctgcagatcccgcagtacttcctcatcacctgcggcgaggtggttttctctgtcaca 

ggactggagttctcatattctcaggccccctccaacatgaagtcggtgcttcaggcgggatggctgctgacagtggctgtt 

ggcaacatcattgtgctcattgtggcaggagcaggccagttcagtgaacagtgggctgaatacatcctatttgcggcattg 

cttctggttgtctgtgtaatatttgccatcatggcccggttttacacttacgtcaatccagcagagattgaagctcagtttgacg 

acgatgagaaaaagaacctggaaaagatgaatgtatattccacggtaactccggtctcacagacacagatg 

Canine PepTl Amino Acid Sequence (SEQ ID NO:21) 

MGMSKSYGCFGYPLSIFFIVVNEFCERFSYYGMRALLILYFRRFIGWDDNLS 

TAIYHTFVALCYLTPILGALIADSWLGKFKTIVSLSIVYTIGQAVTAVSSINDL 

TDYNKDGTPDNLSVHVALSMIGLALIALGTGGIKPCVSAFGGDQFEEGQEK 

QRNRFFSIFYLAINAGSLISTIVTPMLRVHECGIYSQKACYPLAFGVPAALMA 

VSLWFVIGSGMYKKFQPQGNVMGKVVKCIGFALKNRFRHRSKQFPKREH 

WLDWAKEKYDERLISQIKMVTKVMFLYIPLPMFWALFDQQGSRWTLQATA 

MSGKIGLLEVQPDQMQTVNAILIVVMVPIMDAVVYPLIAKCGFNFTSLKRM 

TVGMFLASMAFVMAAIVQLEIDKTLPVFPKQNEVQIKVLNIGNGAMNVSFP 

GAWTVSQMSQSDGFMTFDVDKLTSINISSTGSPVIPVTYNFEQGHRHTLLV 

WAPNNYRWKDGLNQKPEKGENGIRFINSLNESLNITMGDKVYVNVTSHN 

ASEYQFFSLGTKNITISSTQQISQNCTKVLQSSNLEFGSAYTYVIGTQSTGCPE 

LHMFEDISPNTVNMALQIPQYFLITCGEVVFSVTGLEFSYSQAPSNMKSVLQ 

AGWLLTVAVGNIIVLIVAGAGQFSEQWAEYILFAALLLWCVIFAIMARFYT 

YVNPAEffiAQFDDDEKKNLEKMNVYSTVTPVSQTQM 

All publications, patents and patent documents are incorporated by reference 
herein, as though individually incorporated by reference. The invention has been 
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described with reference to various specific and preferred embodiments and 
techniques. However, it should be understood that many variations and 
modifications may be made while remaining within the scope of the invention. 



104 



